BOOTSTRAP PERCOLATION IN HIGH DIMENSIONS 
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Abstract. In r-neighbour bootstrap percolation on a graph G, a set of initially infected 
vertices A C V(G) is chosen independently at random, with density p, and new vertices 
are subsequently infected if they have at least r infected neighbours. The set A is said 
to percolate if eventually all vertices are infected. Our aim is to understand this process 
on the grid, [n] d , for arbitrary functions n = n(t), d = d(t) and r = r(t), as t — > oo. 
The main question is to determine the critical probability p c ([n] d , r) at which percolation 
becomes likely, and to give bounds on the size of the critical window. In this paper we 
study this problem when r = 2, for all functions n and d satisfying d 3> logn. 

The bootstrap process has been extensively studied on [n] d when d is a fixed constant 
and 2 ^5 r ^ d, and in these cases p c ([n] d , r) has recently been determined up to a factor 
of 1 + o(l) as n — > oo. At the other end of the scale, Balogh and Bollobas determined 
p c ([2] d , 2) up to a constant factor, and Balogh, Bollobas and Morris determined p c ([n] d , d) 
asymptotically if d ^ (loglogn) 2+e , and gave much sharper bounds for the hypercube. 

Here we prove the following result: let A be the smallest positive root of the equation 

2^ 2 k2 ~ k k\ 

k=0 

so A ss 1.166. Then 

if d is sufficiently large, and moreover 

Pc ([n] d ,2) = (4A + o(l)) (^) 2 ^2-V^ 
as d — > oo, for every function n = n(d) with d 3> logn. 



1. Introduction 

Bootstrap percolation is a cellular automaton, in which an infection spreads (on a graph 
G, with threshold r e N) according to the following deterministic rule: a vertex with at 
least r infected neighbours becomes infected, and infected sites are infected forever. To 
be precise, let A = A C V(G), let 

A t+1 := A t U {v E V(G) : |I» H A t \ > r) 
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for each t G N = {0, 1,2, . . .}, and let [A] = \J t A t . We say that A percolates if [A] = 
V(G), i.e., if every vertex is eventually infected. 

The bootstrap process is closely related to the Ising Model of ferromagnetism, and was 
first introduced in 1979 by Chalupa, Leath and Reich [17] in the context of statistical 
mechanics. The main question is as follows: Suppose that the elements of the set A are 
chosen independently at random with probability p. For which values of p is percolation 
likely to occur? More precisely, writing P p for the Bernoulli distribution with density p, 
and given a graph G, a threshold r G N = {1, 2, . . .} and a G [0, 1], define 

p a (G,r) := inf |p : P p (A percolates in r-neighbour bootstrap) ^ aj. 

We shall write p c (G, r) for pi/2{G, r), the critical probability for percolation. Our aim is to 
give sharp bounds on p c (G, r), and to determine the size of the critical window, pi_ £ —p £ . 

This problem has been extensively studied on the grid [n] d , with d and r fixed. In 
particular, Schonmann |33j showed that p c (Z d ,r) G {0,1} for all d and r, Aizenman 
and Lebowitz [3] determined p c ([n] d ,2) up to a constant factor, and Cerf and Cirillo [15] 
(for d = r = 3) and Cerf and Manzo [16] did the same for p c ([n] d ,r) for all 2 ^ r ^ 
d. The result of Cerf and Cirillo was a particularly important breakthrough, since the 
problem becomes much more difficult when r ^ 3; in Section [6] we shall see that a similar 
phenomenon also occurs on the hypercube. In another significant breakthrough, the first 
sharp threshold for the critical probability was proved by Holroyd [25], who showed that 



The sharp threshold for all d ^ r ^ 2 was recently determined by Balogh, Bollobas 
and Morris [7J (for d = r = 3) and Balogh, Bollobas, Duminil-Copin and Morris [5] (in 
general) . 

Having proved such sharp bounds for d fixed, it is natural to ask the question: what 
happens when n, d and r are all allowed to tend to infinity? In general, given arbitrary 
functions n — n(t), d — d{t) and r = r(t), with n + d — 7-ooast— y oo, we would like to 
determine p c ([n] d ,r), and show that percolation has a sharp threshold. 

A first step in answering this question was provided by Balogh and Bollobas [1], who 
determined the critical probability for two-neighbour bootstrap percolation on the hyper- 
cube up to a constant. They proved that 



Balogh, Bollobas and Morris [5] also studied the majority (i.e., r = \d/2\) bootstrap 
process on the hypercube. They gave very sharp bounds on the critical probability, and 
moreover used the techniques developed in their proof to show that 



as d — y oo if n = n(d) satisfies d ^ (loglogn) 2+e . (As a consequence of Schonmann's 
proof in [33J, we have p c ([n] d , d) = o(l) if d ^ log* n.) Morris [28] also used the techniques 





Pc([n] d ,d) = l + o(l) 
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of [5] to prove that, in zero-temperature Glauber dynamics on 7L d , the critical threshold 
for fixation converges to 1/2 as d — > oo (see also [22J). 

In this paper we shall determine a sharp threshold for the critical probability in two- 
neighbour bootstrap percolation on the hypercube, and more generally on all sufficiently 
high- dimensional grids [n] d . We shall moreover give fairly tight bounds on the second 
term of p c ([2] d , 2). Throughout the paper, let A ~ 1.16577 denote the smallest positive 
root of the equation 

00 ( -\\ k \ k 
2 k2 ~ k k\ 

k=0 

The following theorem is our main result. 

Theorem 1. If d is sufficiently large, then 

16A ^ , logd\ o _ 2v ^ d 16A / 5{logd) 2 \ 2y/ - d 



* v 1+ ^J 2 * M2| - 2) * ^l 1+ ^rJ 2 • 

where A ~ 1.166 is as defined above. 

We shall also determine a sharp threshold for p c ([n] d ,2) for all high-dimensional grids 
[n] d . Here, and throughout, log is to the base 2, unless otherwise stated. We write / 3> g 
to indicate that g(d)/f(d) — )■ as d — >■ oo. 

Theorem 2. Let n = n(d) be a function such that d ^> logn ^ 1 as d — >• oo. Then 
Pc ([n} d ,2) = (4A + o(l)) (^y) 2 ^- 2 ^^, 

as d — >■ oo, where A ~ 1.166 is as defined above. 

In fact (see Theorems 1321 and 138]) . if d ^ jt, 16+c for some constant c > then we shall 
prove bounds on the second order term oi p c ([n] d ,2) similar to those in Theorem [TJ 

Remark 1. On the torus /L d , with n ^ 4, straightforward calculations suggest that the 
critical probability is the following, slightly different function: 

Pc(Zi,2) = 1^11 2 - 2 ^. 

The upper bound follows by the method of Section HI but the simple coupling we use to 
prove the lower bound (see Lemma [341 is no longer valid for the torus. The lemma should 
still be true for n ^ 4; for n = 3, on the other hand, the lemma is false and the critical 
probability is very different. See Section |6] for further discussion of these issues. 

The hypercube is a very well-studied combinatorial object; for example, see the classical 
work (relating to a different percolation problem on the hypercube) of [201 El [13] , or the 
more recent improvements [TH EH [25]. We also note that the bootstrap process has been 
studied on other graphs, such as infinite trees [H1ETJE], the random regular graph [T0[|27]. 
and a more general class of 'locally tree-like' regular graphs j5]. Some very recent results 
on bootstrap percolation in two dimensions can be found in [23J and [181 . For more on 
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the links between bootstrap percolation and statistical physics, see ESI ED], or the 
survey pp. 

The main step in the proof of Theorems [1] and [2] is Theorem [3l below, which gives very 
tight bounds on the probability that a small hypercube (a 'critical droplet') is internally 
spanned (see below). In order to state this result, however, we first need the following 
definitions. 

Let d and n = n(d) be fixed, and consider two-neighbour bootstrap percolation on [n] d ; 
that is, from now on we assume that G = [n] d and r = 2. We shall call a set which is 
isomorphic to [ai] x • • ■ x [a^] for some a±, . . . , £ N a cube, and a set which is isomorphic 
to [2] £ for some £ £ No a hypercube. Note that given any set A C [n] d , the set [A] is a 
disjoint union of cubes at distance at least 3 from each other. 

Given a cube Q = [ai] x . . . x [aj C [n] d , define the dimension of Q to be 

d 

dim(Q) := X)(ai-1), 

8=1 

and note that a single vertex is a cube of dimension zero. We write d(S, T) for the distance 
(in the graph [n] d ) between two sets S and T. 

We say that the cube Q is internally spanned by a set A C [n] d if [A fl Q] = Q. For 
hypercubes, the following more restrictive definition will play a crucial role in the proof. 

Definition. Let £ £ N . A cube Q = [2] 2£ is said to be sequentially spanned by a set 
A C Q if \A\ — £ + 1, and there exists an order (do, • • • , ai) of the elements of A such that 

d(% + i, [{ao, • • •,%}]) = 2 

for each ^ j : ^ i — 1. In particular, note that [{ao, . . . , a 3 }] is a subcube of dimension 
2j for each j £ [£]. We call such an ordering (a , ...,%) a spanning sequence for Q. 

We say that Q = [2] 2e is sequentially internally spanned by a set A C if it is 
sequentially spanned by A HQ. 

Given a set S, we write A ~ Bin^, p) to mean that the elements of A C S 1 are chosen 
independently at random with probability p. Recall that P p denotes the distribution 
A ~ Bin([n] d ,p). Now, for each £ £ N define 

P(£,p) := P p (A internally spans [2]^), 

and 

Q(2£,p) := P p (A sequentially internally spans [2} 2e ). 
Balogh and Bollobas [4J proved that, if 2 e p ^ 1, then 

^/4-2y* + 3)/2 ^ p ^ p) ^ e e 2 ey4 p{ e + 2)/2^ 

and used this result to determine p c ([2] d ,2) up to a constant factor. An error of order 
2 M in the approximation of P(£,p) corresponds to a multiplicative constant error in 
the calculation of p c . Thus for Theorem [2] we need to determine P(£,p) up to a factor 
of 2°( £ \ and for Theorem [1] we are allowed only a factor of £°^°^ l \ In fact we shall 
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do somewhat better even than this. The following theorem determines P(£,p) up to a 
constant multiplicative factor; for even £ this constant is 5/2. 

Theorem 3. There exists a constant 5 > such that the following holds. Let £ G N and 
p > 0, with £ 2 2 2e p ^ 5, and let A ~ 1.166 be as defined above. Then 

(a) -(2£)\\- e 2 e2 p e+1 ^ Q(2£,p) ^ P(2i,p) < (2£)\ X~ e 2 e p i+1 . 
5 

(5) _L( 2 £+i)!A- £ 2( m V +2 < P(2£+l,p) ^ h{2£ + 1)! X~ e 2 (m) V +2 - 

It is interesting to note that, since our proof is by induction, and is extremely delicate 
in places, we could not have proved a much weaker result than Theorem [31 In particular, 
our proof does not work if the induction hypothesis has a large multiplicative constant 
error term, except in the lower bound of part (b). 

The rest of the paper is structured as follows. First, in Section [2] we state a technical 
lemma, and make some definitions. Section [3] contains the proof of Theorem [3] and is 
the main part of the paper. In Sections H] and [5] we shall deduce Theorems [TJ and [5] from 
Theorem [31 and in Section [6] we state some open problems. Finally, in Appendix A we 
prove the technical lemma. 



2. Tools and Notation 

In this section we shall state our main technical lemma, and recall some basic facts 
about bootstrap percolation. The proof of Lemma H] is somewhat lengthy, and is given in 
Appendix A. 

For each m G N, let 

(2A) m 



2 m2 m\' 

oo 

and note that ^^(— l) m+1 a m = 1, by the definition of A (see (QQ), above). 



(2) 



m=l 



Lemma 4. Let I G N, and let g,h : No — > M + be non-negative functions satisfying 
g(0) = 0, and 1 ^ h(m) ^ 1 + g(m) for every m G No, and let A ~ 1.166 and (a m ) be as 
defined above. Suppose that /(0) = 1, /(l) = A/2, and 



m) 

m=l 



f(t) = Y,{-^) m+1 a m h{t-m)f{t 
for every 2 ^ t ^ £. Then 

m=l 



1 
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Remark 2. The bounds in Lemma H] are close to best possible. To see this, suppose (for 
simplicity) that we replace the sequence (ax, 0,2, . . .) by the (ideal) sequence (1, 0, 0, . . .). 
Then f(i) — A/2 if h(m) = 1 for every m G N, and 



if h(m) = 1 + g(m) for every m G N. 

We next recall the concept of disjoint occurrence of events, and the van den Berg-Kesten 
Lemma [11], which utilizes it. In the setting of bootstrap percolation on a graph G, two 
increasing events E and F occur disjointly if there exist disjoint sets S,T G V(G) such 
that the infected sites in 5* imply that E occurs, and the infected sites in T imply that F 
occurs. (We call 5* and T witness sets for E and F.) We write E o F for the event that 
E and F occur disjointly. 

The van den Berg— Kesten Lemma. Let E and F be any two increasing events defined 
in terms of the infected sites A C V(G), and let p G (0, 1). Then 



We next state a fundamental lemma from [I] , which allows us to apply the van den Berg- 
Kesten Lemma. An almost identical lemma was proved independently by Holroyd [26] 
for the two-dimensional lattice. 

Lemma 5. Let Q C [n] d be a cube, and suppose Q is internally spanned by A C [n] d . 
Then there exist proper subcubes S,T C Q such that 

• [S U T] = Q, and 

• S and T are disjointly internally spanned by A. 

In particular, we may take S to be the largest internally spanned proper subcube of Q. 

We give a sketch of the proof to aid the reader who is unfamiliar with these concepts; 
for the details, see the proof of Lemma E2J which is a slight generalization. 

Sketch of proof. Given two subcubes, S and T, the span of their union, [S U T] is either 
just their union, S U T, or a new (larger) subcube. Thus, if A spans Q, we may consider 
the process as a series of moves of the form (S,T) h-> [5 U T], where [S U T] is a cube, 
starting with all single sites (dimension zero subcubes), and ending up with Q. At each 
stage, all of the subcubes are disjointly spanned. The subcubes S and T given by the 
lemma are those found at the penultimate step of this process. □ 

The next lemma is also from [1], and gives the minimal number of sites in a spanning 
configuration. Given a cube Q = [a±\ x . . . x [ad] C [n] d , recall that the dimension of Q is 




F p (EoF) < ¥ P (E)F P (F). 



d 

dim(Q) = ^(oi-1). 
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Note that if S, T and Q are cubes, and [SUT] = Q, then dim(S') + dim(T) ^ dim(Q) —2. 
This follows because the distance between S and T must be at most two (since their 
closure is a cube), and so they must overlap in all but at most two directions. 

Lemma 6. Let Q be a subcube of [n] d . If A spans Q, then 

\A\ > ^ + 1. 
1 1 2 

In particular, if A spans [2] 2£ then \A\ ^ £ + \, and if A spans [2] 2£+1 then \ A\ ^ £ + 2. 

Proof. The proof is by induction on the dimension of Q, using Lemma [5j Indeed, let S 
and T be disjointly internally spanned subcubes with [S U T] = Q, and let Ai,A 2 C A 
with L4 X ] = 5, L4 2 ] = T and Ai n A 2 = 0. Since [SUT} = Q, we have dim(S) + dim(T) ^ 
dim(Q) — 2. Hence 

i^noi > i^i+w > d " n(s) ; dim(r) + 2 > 

as required. □ 

We finish this section by describing some of the notation we shall use throughout the 
paper, and especially in Section |3j Given a subcube Q C [2] d , we shall denote Q as a 
member of {0, 1, *} d in the obvious way, i.e., if Q = (z±, . . . , z<i) G {0,1, *} d , then 

(Vi, ■■■,yd) e Q ^ = for every j with ^ e {0, 1}. 

We shall denote by Q(ji, ■ ■ ■ , jk) the collection of maximal subcubes of Q which are 
constant on {ji, . . . ,jk}, and by Q[ji, . . . ,jk] the collection of maximal subcubes of Q 
which are constant on {j h . . . ,j k } c = [d] \ {ji, . . . ,j k }. 

Given subcubes B, C C [2] d , we write A(B, C) for the set of directions in which B and 
C are both constant and differ, and given vertices b, c G [2} d , define A (6, c) and A (6, C) 
similarly, noting that a vertex is a cube of dimension zero. Observe that d(B, C) = 
\A(B,C)\. Finally, note that N= {1,2,...} and N = {0,1,2,...}. 

3. Percolation in [2] d 

In this section we shall prove Theorem [3j The proof comes in two parts: first we bound 
Q(2£,p) from both sides, and then we bound P(£,p) from above. The methods in each 
case will be similar, but for P(£,p) we shall have many more complications to overcome. 

3.1. Bounding Q(2£,p): a lower bound for P(2£,p). We begin with the simpler case; 
not only will it serve as a helpful warm-up, but also this result will be a crucial tool in 
the proof of the harder case, and in Section 0] (see in particular Lemmas |2U1 [2T] and UE\ 
and Proposition [T6l). 

Let £ G N , and suppose that the cube C = [2} 2e initially contains exactly £ + 1 active 
sites. Let A(£) denote the collection of all such (£+ l)-sets, and let S{£) C A(£) denote 
the collection of those (£ + l)-sets which sequentially span C. We shall prove the following 
theorem. 
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Theorem 7. Let leN, and let A ~ 1.166 be as defined in (0Q). Then, 

A ^ / c/*\l / ^ /o«\i \-<o< 2 



1 - - (2£)!A" £ 2 £ < |«S(£)| ^ -(2£)!A- £ 2 
2 / 2 



Hence, if 2 p is sufficiently small, then 

-(2£)\\- £ 2 e2 p l+1 < Q(2£,p) < -(2£)!A^2V +1 - 
5 5 

We start by covering S{£) with sets N(j,k) as follows. For each A G «S(£) and each 
1 ^ j < k ^ 2£, say that the pair {j, k} is an ending for A if some A' C A sequentially 
spans one of the subcubes C(j, k). Note that if {j, k} is an ending for A, then j and k are 
the last dimensions to be infected for some spanning sequence for C, and that = 1. 

Now, for each 1 ^ j < k ^ 2£, define 

N(j, k) = Ng{j, k) = {Ae S(£) : {j, k) is an ending for A], 

and observe that S{£) = N(j, k). We shall count S{£) by inclusion-exclusion, using 

j<k 

the sets N(j, k). The following lemma is the key step. 

Lemma 8. Let £, m e N with £ ^ 2, and let {ji, ki}, . . . , {j m , k m } C [2£] be distinct pairs. 
If the elements j q and k q are all distinct, then 



q=l 



2 2m (2 2e - 2m ) m \S(£-m) 



and otherwise f] q=l Ne(j q , k q ) = 0. 

We shall use the following two simple lemmas in the proof of Lemma We shall prove 
a slightly more general version of each that we need here, since we shall use them again 
in the proof of Lemma [15] in Section 13.31 We need the following definition. 

Definition. Given a cube Q C [2] d and a set A C Q, we say that {j, k} is a final pair 
for A in Q if A internally spans some subcube C G Q(j, k), and internally spans Q, and 
\A\ C\ = 1. We say that {i} is a final element for A in Q if A internally spans one of the 
subcubes C G Q(i), and internally spans Q, and \A \ C\ = 1. 

Note that if {j, k} is an ending for A, then it is also a final pair for A in Q (but not 
vice- versa, since the former has the extra condition that C is sequentially spanned). The 
following lemma says that final pairs and final elements are all disjoint. 

Lemma 9. Let {j, k} be a final pair for A in Q. Then j and k are not final elements for 
A in Q. Moreover, if{j',k'} is a different final pair, then j , k, j' and k' are all distinct. 

Proof. First suppose that both {j, k} and {j, k'} are final pairs for A in Q, and for sim- 
plicity assume that j — 1, k — 2 and k' = 3. Then A internally spans a subcube in 
Q(l, 2), without loss of generality suppose it is C — (0, 0, *, . . . , *), and A\C consists of a 
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single element, which lies in the cube (1, 1, *, . . . , *). In particular, note that the subcubes 
(0, 1, *,..., *) and (1, 0, *,..., *) contain no elements of A. 

Now, A also internally spans some subcube D in Q(l,3). But one of the sets D fl 
(0, 1, *,..., *) and D D (1, 0, *, . . . , *) is a subcube of D, of dimension dim(D) — 1, which 
contains no element of A (the other is the empty set). Thus A cannot span D, and so we 
have a contradiction. It is similarly easy to see that if {j, k} is a final pair, then {j} is 
not a final element for A in Q. □ 

The next lemma will allow us to recursively obtain a sequence of sequentially internally 
spanned subcubes of a sequentially spanned hypercube Q. 

Lemma 10. Let £ G N, let A C Q = [2] , and suppose that A spans Q. Let S be an 
(£ — 2) -dimensional subcube of Q, with \A\S\ — 1. Then S is internally spanned by A. 
Moreover, if Q is sequentially spanned by A, then S is sequentially spanned by An S . 

Proof. Let 5* C Q be a cube with dim(S') = i — 2, and let v be the unique element of 
A \ S. We may assume, without loss of generality, that S = (0,0,*,...,*), and that 
v = (1, 1, 0, . . . , 0). The basic fact we shall use is that, for any subcube T C S, 

[TU{v}]r\S = T, 

which follows because if d(v , T) ^ 2 then we must have (0, . . . , 0) G T. 

Now suppose A spans Q, and let [A D S] = 7\ U . . . U T t , where the Tj are all subcubes 
of S, and d(Ti, Tj) ^3 for every i ^ j. If A D S does not span S then t ^ 2. Note that 
d{Tj, v) *C 2 for at most one j G [t], and that [T i U{u}]n5' = Tj. Thus d([Ti], [7)U{u}]) ^ 3 
for every i ^ j. Therefore A does not percolate in Q, which is a contradiction. 

Next suppose that A sequentially spans Q, let (a , . . . , a^) be a spanning sequence for 
Q, so 

d(aj-+i, [{clq, . . .,dj}]) = 2 
for each ^ j ^ £ — 1, and suppose that v = a q . We claim that 

(a , • • • 

is a spanning sequence for S. Indeed, let Uj = [{a , . . . , a, } \ {a q }] for each j G 
and note that Uj C 5 and [Uj U PI S — Uj. Also a J+ i G 5 for every j + 1 ^ q, 

so d(dj + i,Uj) = d(aj +1 , [Uj U {ctq}]) = 2. Therefore A \ {a q } sequentially spans S, as 
claimed. □ 

The following structural lemma follows from Lemma [TTJJ We shall use this result in the 
proof of Lemma [U below, and also in Section 13.31 

Lemma 11. Let £, m G N, with £ ^ 3 and £ ^ 2m, and let A C Q = [2] e . Suppose that 
A spans Q, and that {2q — 1, 2q} is a final pair for A in [2} e for each q G [m]. 

Then there exists a subcube C G C(l, . . . , 2m), and a set A' = A \ {ai, . . . , a m }, such 
that 

(a) A' spans C , 

(b) A(a g , C) = {2q - 1, 2q] for each q G [m] . 
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Moreover, if {2q — 1, 2g} is an ending for each q G \m\, then A 1 sequentially spans C . 

Proof. For any final pair {j, k} for A, there is a vertex a G A such that j, k G A(a, c) for 
every a ^ c e A. Since \A\ ^ \£/2] + 1 ^ 3, this element is unique. Let a q be this vertex 
for the pair {2q — 1, 2g} for each g G [m]. 

First note that the a q are all distinct. Indeed, if a\ = a 2 say, then the other elements 
of A must all lie in a (2£ — 4)-dimensional subcube of C at distance 4 from ai, and so A 
would not span C. 

Thus we may apply Lemma [10] for each vertex a q in turn, obtaining a sequence of 
subcubes C = C D G\ D . . . D C m , such that dim(C g ) = 2£ — 2q for g G [m], and A fl C g 
spans C g . Let C = C m , and i' = An C. 

Finally, observe that A(a g ,C) = {2q — l,2q} for each q G [m]. This follows because 
A' is non-empty, so for any g^rG [m] we have 2r — 1, 2r G A(a r , a') fl A(a r , a 9 ) for any 
a' G A', which means that 2r - 1, 2r ^ A(a 9 , a'), and so 2r - 1, 2r ^ A(a ? , C"). 

The proof that A' sequentially spans C, if each {2q — 1, 2q] is an ending, is exactly the 
same, using the corresponding part of Lemma [10J □ 

We can now prove Lemma [HJ 

Proof of LemmalEl First note that if A 6 N(j,k) then {j, k} is a final pair for A in 
C = [2] 2£ , so it follows that N(j, k) n iV(j, jfe') = for k ^ jfe' by Lemma EE So let 
ji, . . . ,j m , hi, . . . , k m be distinct elements of [2£], and assume for simplicity that in fact 
j q = 2q — 1 and A; 9 = 2q for each g G [m] . 

Suppose that A G HoLi ^0?) ^g)> so {^^ — 1 ; 2g} is an ending for A for each q G [m]. 
Then, by Lemma [Tl] there exists a subcube C G C(l, . . . , 2m), and a subset A' C A, such 
that A' sequentially spans C, and A \ A' — {a±, . . . , a m }, for some a q with A(a q , C) = 
{2q — 1, 2q} for each g G [m). 

Now we only have to count. We have 2 2m choices for C and, given C, \S(£ — m) \ choices 
for A' and 2 2e ~ 2m choices for each a q . Finally, we claim that each of these choices gives 
a different set A. Indeed, if £ > m then dim(C') ^ 2, and so C is the only member of 
C(l, . . . , 2m) with more than one element (the others are either empty, or contain exactly 
one a q ). Thus we can reconstruct C, A' and from A. If £ = m then C is a 

single vertex, but is the only element of A at distance two from all of the others, since 
d(a q , a r ) ^ 4 for each q, r G [m]. Note that here we needed m = £ ^ 2. □ 

Observation 12. Let £, m G N wift £ ^ m. There are 

_L_ j=r /2£ - 2g\ _ (2£)! 

V 2 J _ 2 m m!(2£- 2m)! 

different ways of choosing m disjoint pairs from [2£] . 

Now, using Lemmas H] and El we can prove Theorem [71 

Proof of Theorem [7| The proof is by the technical lemma, Lemma [H Define a function 
g : N -> M+ by 

|5(£)| = (?(£)(2£)!A^2^, 
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for each £ £ No, and observe that therefore 



'm-l 

n 

.9=0 



2£-2q 
2 



2 2ern - m2+m \S{£-m)\ = g{e-m){2£)\\- e+m 2 E2 . 



Recall that S(£) = N(j, k). Therefore, by inclusion-exclusion, Lemma [S] and Observa- 

j<k 

tionfTSl we have 



\S 



£(-ir +1 E 

Ol.fel},— ,{jm,km} 



m=l 



q=l 



m=l \q=0 V 7 / 



)2^m-2m 2 +2m 



S{£ - m) 



f 2 i 

y^(_l)m+if ( g (£_ m u 2 £y.X- e+m 2 i2 

■ / -^ m! V 

g(£-m). 



m=l 



So, recalling that a r 



(2£)\\- e 2 e2 Y,(-~L) m+1 y 
(2A) 



m=l 



/(2A) m 
2 m2 m! 



, , , it follows that 
2 m2 m! 

1 



m 



(3) 



for every 2 ^ £ £ N. Moreover, note that |«S(0)| = 1 and |«S(1)| = 2, so it is easy to check 
that g(0) = 1 and g(l) = A/2. Thus, by Lemma HJ applied with h(m) = 1, we obtain 

1 - ^ m < 

as required. 

Finally note that, for each A £ S(£), the probability that the set of initially active sites 
in [2] 2e is exactly A is p e+1 (l — p) 22e ~ l ~ l . Hence 

j _ A Q(2l,p) A 

2 ^ (2£)!A-^2V +1 (1 -p) 22 '^ 1 " 2' 
The bounds for Q(2£,p) now follow if 2 2£ p is sufficiently small, since A < 6/5. 



□ 



Remark 3. Theorem [7] determines \S(£)\ up to a factor of about 3/2, but in fact the 
proof allows us to calculate \S(£)\ exactly for any fixed £, via the recursion ((3]). Computer 
simulation indicates that the correct constant g(£) is approximately 0.4976 for large £. 
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3.2. Bounding P(£,p): motivation and definitions. We now move on to the harder 
part of the proof: bounding P(£,p) from above. There are several ways in which [2] e 
may be spanned, and in order to deal with them (by induction on £) we shall need very 
sharp bounds on P(£,p). We overcome these problems by defining a new function, R(£,p), 
for which we obtain a precise recurrence relation (see Lemma [H]). We then prove that 
R(£,p) ~ P(£,p) (see Lemma [TTj) using the induction hypothesis, which allows us to apply 
the technical lemma. We deduce that sequential spanning is (roughly) the dominant kind 
of internal spanning when £ is even and p is not too large. 

Let us begin by defining a collection of events M(j, k), which will serve the same purpose 
here as the sets N(j, k) did above. Recall that, given a cube Q and a set A C Q, we 
say that {j, k} is a final pair for A in Q if A internally spans Q, A internally spans some 
subcube C G Q(j, k), and | A \ C\ = 1, and that {i} is a final element for A in Q if A 
internally spans Q, A internally spans one of the subcubes C G Q(i), and \A \ C\ = 1. 

In all that follows, we shall assume that the elements of A are chosen independently at 
random with probability p. Now given leN, and i, j, k G [£] with j ^ k, let 

L{i) = Li(i) := the event that {i} is a final element for A in [2] e , 

and 

M(j, k) = Mi(j, k) := the event that {j, k} is a final pair for A in [2] £ , 



and note that P(£,p) ^ P p ( (J M(j, k) U (JL( 



Now, for each £ G N and p G (0, 1), let 

R(2£,p) := F p l\jM 2e (j,k) 

\j<k 

and let 

i?(2£ + l,p) := Pj |J M 2m (j, fc) U |J W< 

\j<fc i 

and note that Q(2£,p) ^ R(2£,p), and that i2(^,p) ^ P(€,p) for each £ G N . 
We now state the theorem we shall in fact prove by induction on £. 

Theorem 13. Let 5 > be sufficiently small. Then there exists a non-negative function 

g : No — > M + , wit/i ^(w,) ^ -, suc/i that 



m=0 

t-l 



R(2£,p) ^ ^exp(^-^^(m))(2£)!A^2V +1 (l-p) 



2 2l -£-l 



m=0 



for every £ G N and p > u>rf/i £ 2 2 2£ p ^ 5. under the same conditions, 

R(2t+l,p) ^ 4(2£+l)!A- £ 2^ +1 V +2 - 
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Moreover, 

P(2£,p) ^ (l + g(£))R(2£,p) 

and 

P{2£ + l,p) ^ 5(2£+l)!A-'2 (m) V +2 

for every leN. 

The upper bounds in Theorem [3] follow immediately from Theorem [T2J Indeed, first 
note that 

1 + g(£) ^ e 9 ^ < exp ^ 9 ^ 



2 - A 



since A > 1 and g(£) ^ 0. Hence 



P(2£,p) ^ ^expU^^m)] (2£)\ X~ e 2 g2 p i+1 (1 - p) 

\ m =0 J 



and 

r xp (rA m t;( m )) < r xp (^A)) < 1; 

since A < 6/5. We remark that with a little more care, we could have improved this upper 
bound by a factor of about 4/5. 

We shall need the induction hypothesis in order to prove several of our preliminary 
lemmas. So, for ease of reference, we shall refer to the following conditions as the properties 
(*) for L. 

(a) If 2£ «C L then 

P(2£,p) ^ (2£)\\- e 2 e2 p e+1 (l-pf e - e - 1 . 

(b) If 2£ + 1 < L then 

P(2£+l,p) ^ 5(2£+l)\\- e 2^ 2 p e+2 . 
Observe that the properties (*) for £ imply that 

P{£,p) ^ £\\- l / 2 2 l2 iy' 2+1 (4) 

if 2 e p 1/100, say. 

3.3. Recurrence relations. We shall bound R(£,p) using inclusion-exclusion, induction, 
and the technical lemma. We shall have to treat the odd and even cases separately. The 
following lemma gives us the recursions we need; combining them with Lemma [171 (below) 
will give the desired result. 
For every £ e N , let 

z{2£) = {2£)\X' i 2 i2 . (5) 

Also, here and throughout, let 

^ - (I - P f- 2i - 2m - m (6) 



14 



JOZSEF BALOGH, BELA BOLLOBAS, AND ROBERT MORRIS 



Lemma 14. Let 2<IgN and p > 0. Then 

R(2£,p) = Y,(-l) m+1 a m p m S 2e , m , p ^^P(2£~2m,p), 



m^l 

and 



2£+l 



R(2£+l,p) <: (2£+l)2 2e+1 pP(2£,p) + I ^ j2 2e+1 p P(2£ - l,p). 

Lemma [141 will follow fairly easily from the following lemma, which plays the same role 
in the current proof as did Lemma [S] in the previous subsection. We remark that we could 
obtain a similar recurrence for f] q M(j q , k q ) fl L(i), but we shall not need this. However, 
the situation for L{i) fl L(j) is more complicated (see Proposition [16]) . 

Lemma 15. Let £ ^ 3 and m ^ 1 fee integers, let {jx, k{\, . . . , {j m , k m } C [£} be distinct 
pairs, and letp>0. If the elements j q and k q are all distinct, then 



F p lf]M{j q ,k g )) = 2 2m (2 e - 2m ) m p m S i<m!P P(£-2m,p), 



and otherwise F p [CX^Li Mi(j q , k q )j = 0. 

Proof. The proof is almost the same as that of Lemma First note that we have 

F p (M(j,k)nM(j,k')) = 

if k 7^ k' by Lemma [9j 

So let C = [2} e , let {ji, ki, . . . , j m , k m } be a set of 2m distinct elements of [£}, and 
assume for simplicity that in fact j q = 2q — 1 and k q = 2q for each q G [m] . Suppose A 
is such that the event DgLi M(j qi k q ) holds. Then, by Lemma [TTl there exists a subcube 
C G (7(1, ... , 2m) and a set A' C A, such that A' spans C, and A\A' = {ax, . . . , a m } 
for some a q with A(a q , C) = {2q — 1, 2q} for each q G [m]. 

Now we only have to count. We have 2 2m choices for C and, given C, 2 l ~ 2m choices for 
each a q . The probability that C is internally spanned is P(£ — 2m,p), and the probability 
that A\C — {a x , . . . , a m } is exactly p m (l - p )2 £ '-2 £ '- 2m -m = p m g^ 

Finally, we claim that each of these choices gives a different set A. Indeed, if £ > 2m 
then dim(C') ^ 1, and so C is the only member of (7(1, . . . ,2m) with more than one 
element (the others are either empty, or contain exactly one a q ). Thus we can reconstruct 
C, A 1 and from A. If £ = 2m then C is a single vertex, but is the only element 

of A at distance two from all of the others, since d(a q ,a r ) ^ 4 for each q,r G [m]. Note 
that here we needed m = £/2 ^ 2. □ 



We can now prove Lemma [T] 

Proof of Lemma 14 ■ First consider the even case. By Observation [12], there are 

1 (2£~2q\ (2£)\ 
ffllJJJ 2 J ~ 2 m m\(2£-2m)\ 
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ways of choosing m pairwise disjoint pairs {ji, ki}, . . . , {j m , k m } C [2£] with ji < h for 
each i. Observe also that 

2 m (2l)! ( 2e . 2m] m z(2£) 
m\(2£-2m)\ { > m z{2£ - 2m)' 

Thus, by the inclusion-exclusion formula and Lemma [151 we have 

U m 2 ,(j, k) = ]T(-ir +i £ p P n M 2e ( 3q , k 

j<k / m=l {il,fcl},-,{im,fem} \9=1 

(2^)! 



£(-ir +1 



2 m m!(2£-2m)! 



2 2m (2 2£ - 2m ) m p m ^, m , p P(2£ - 2m, p) 



= ^(-l) m+1 a m ^ ( ^ ( ^ m) p m g 2 ,, m , p P(2^ - 2m, p) 

as claimed. The odd case is easier. Note first that, by the definitions, we have 

F p (L 2e+1 (i)) ^ 2 2t+1 pP(2£,p) 

for each i G [2£ + 1] , and 

F p {M 2e+1 (j,k)) ^ 2 2e+1 pP(2£-l,p) 
for each j, k G [21 + 1]. Thus, by the union bound, 

R(2£ + l,p) ^ £> p (L 2m (i)) + £> p (M 2€+1 (j,fc)) 

< (2£+l)2 2m pP(2£,p) + ^ 2 hl ^2 2£+1 p J P(2£-l,p), 
as required. □ 

3.4. A lower bound for R(2£ + l,p). In this section we shall again use inclusion- 
exclusion, together with some careful counting, in order to prove a lower bound on 
R(2£ + l,p) which is sharp up to a constant multiplicative factor. This lower bound 
will play a crucial role in Section 13.5} below. We remark that in order to prove the propo- 
sition we need the full strength of Theorem[7]- if our bounds on Q(2£, p) were a bit weaker 
(say, if the ratio between the upper and lower bounds were 5/2) then the proof would not 
work. Since we need this lower bound on R(2£ + l,p) in order to prove Lemma [171 below, 
in some sense the entire proof of Theorem [3] rests on the following calculation. 

Proposition 16. Let 5 > be sufficiently small, and let £ G N and p > be such that 
£ 2 2 2l p < 5. Then 

R(2£+l,p) > -L(2£+l)!A-^ +1 V +2 - 
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Proof. Recall that, by Theorem we have 

-(2£)\\- e 2 £2 p e+1 < Q(2£,p) ^ -(2£)\\- e 2 e2 p e+ \ 
5 5 

since 2 2l p ^ S is sufficiently small. Suppose first that £ ^ 19. Then we need to consider 
only the event 1/(1), which has probability at least 

2 2t+1 p{l-pf t Q{2£,p) > 0-^ (2£)\\- e 2^ 2 p e+2 , 

and 5(2 | +1) > y^q, as required. 

So suppose now that £ ^ 20. We consider those ways of spanning Q = [2] 2£+1 which 
use exactly £ + 2 infected sites, and which sequentially span a 2£-dimensional subcube of 
[2] 2£+1 . Indeed, define the event 

L'(i) = L' £ (i) = the event that {i} is a final element for A in [2]^, 

and Q(i) is sequentially internally spanned by A, 

and note that L'(i) =>- L(i). Thus, by inclusion-exclusion, 

R(2£ + l,p) > Ppf|J^ +1 (i)j 

> (2£+l)2 2 ^(l-p) 2 "Q(2^,p) - E P P ( L WiO')nL' 2m (A;)) 

(2£+l)!A^2(^)V+ 2 - ^P p (L' 2W (j)nLWiW)-(7) 

By symmetry, we simply need to bound the probability of the event L' 21+1 {1) nL' 2£+1 (2) 
from above. This is harder than it looks, however, since there are many ways in which 
such an event could occur. Let b\ and b 2 denote the unique 'last' vertices in directions 1 
and 2 respectively. Then A\{b{\ sequentially spans Q{1), so choose a spanning sequence 
(a , • • • , CLi) for Q{1) such that b 2 occurs as late as possible, i.e., let b 2 = a g and let q be 
maximal. Note that q ^ 1, since the first two terms of the sequence are interchangeable. 

Let C = [{clq, . . . , a g __i}], and note that C is sequentially internally spanned by A. Let 
dim(C) = 21 - 2m (so q + m = £ + 1), and let X = (2£ + 1)! A~ £ 2^ 2 p i+2 . We divide 
into cases as follows: 

Case 1: dim(C) =2£-2. 

Without loss of generality, let C = (0, 0, 0, *). Then bi G (1, 0, 1, *,..., *) and 

b 2 G (0, 1,1,*,..., *), since Q(l) and Q(2) are sequentially internally spanned. We have 
choices (in ©) for the pair {j, k} = {1, 2}, 2 3 (2£ — 1) choices for the cube C (given 



BOOTSTRAP PERCOLATION IN HIGH DIMENSIONS 



17 



j and k), and (2 2e 2 ) 2 choices for the vertices b\ and b 2 . Thus the probability of this case 
is at most 

^ 2 +1 j2 3 (K-l)(2 2f - 2 )Vg(2f-2,p) ^ A(2£+1)!A^ +1 2^ 1 ) 2+ V +2 = ^X. 

Case 2: dim(C) ^ 2£ - 4 and d(C, &i) = d(C, b 2 ) = 2. 

Without loss of generality, let C = (0, 0, *,..., *), and note that there are 2m + 1 
zeros. Since d(C,b x ) = d{C,b 2 ) = 2, and i e A(6;,C), we have at most (2m - l)2 2£ - 2m 
choices for each vertex bj] without loss let b 2 = a q e (0, 1, 1, . . . , 0, * . . . , *). Observe also 
that, since (a , . . . , ag) is a spanning sequence for Q(l), and 

[{a , ...,a q }] = (0,*,*,0,...,0, *,...,*), 

it follows that (0, a q+ i, . . . ,cti) is a spanning sequence for Q[4, . . . , 2m + 1] = [2] 2rra_2 . 

We have ( 2 ^ 1 ) choices for the pair {j, k} = {1, 2}, 2 2m+1 choices for the cube C 

(given j and k), and at most 

(2m - X?(2^ T +^ {^\S{m - 1) 

choices for the vertices a q+ i, . . . ,a>i, b\ and 6 2 , given C. [Here 5(£) is the function in 
Theorem [3 The factor m/2 2m_2 is due to the fact that one of the m elements of the set 
in S(m — 1) is = (0, . . . , 0). The factor 2" 1 ™ 1 is due to the fact that we know the first 
two co-ordinates of the points ciqj^i, . . . , ct£ are 0, but we have no control over their third 
co-ordinates.] 

Thus the probability in this case is at most 

m+ ^ 22m+ i p - 1 \ _ 1)2(22 ,_ 2m)m+1 (^_\ s{m _ pm+ i Q{2£ _ 2mjp)j 
which is at most 

(2* + !) ! m[2m _ 1)22 (2l-2m+l)( m+ l) (^3 ( ^ _ 2) , A - m+ l 2 ( m -l) S 



(2£-2m)!(2m-l)! 



> (^(2£-2m)!A^ +m 2^ m ) 2 )//^ 



9A 



m(2m - 1)^2 



-3m+l 



(2£+l)!A^2( m V +2 



18Am(2m- 1) 
25 ■ 2 3m 



25 

Case 3: dim(C) s= 2£ - 4 and d(C, &i) ^ 3. 

The calculation is the same as in Case 2, except we must multiply the final probability 
by Hgzi! 2 - 2mm - 2 . Indeed, let C = (0, 0, *,..., *), note that d(C,b 2 ) = 2 (since 
b 2 = a q ), so we have 2m — 1 choices for the first 2m + 1 coordinates of the vertex 
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b 2 , and without loss of generality let b 2 G (0, 1, 1, . . . , 0, * . . . , *). We again have that 
(0, a q+ i, . . . , at) is a spanning sequence for Q[4, . . . , 2m + 1] = [2] 2m_2 . 

Now, A \ {b 2 } internally spans Q(2), and d(a t , C) ^ 3 for every q + 1 ^ t ^ £, since C 
was chosen to be maximal. Thus there must be a (non-trivial) cube, internally spanned 
by {b\, a q+ x, . . . , g^}, at distance two from C. In particular, there must be two vertices of 
{&i, a q+ x, . . . , a?} which agree on co-ordinates 2m + 2, . . . , 21 + 1. 

22m— 1 

Hence the the probability in this case is at most the probability in Case 2 times - 

(due to our extra choices for the first 2m + 1 co-ordinates of the vertex b\), times (™) (for 
the choice of the two vertices from {b\, a q +i, . . . , ai\ which agree on their last 21 — 2m 
coordinates), times 2~( 2 ^~ 2m ) (due to our reduced number of choices on these coordinates). 
This gives that the probability in this case is at most 

fm\ f 2 2 - 1 \ (2£ _ 2m) A8Am(2m-l) \ m 3 
\2j\2m-l) V 25-2 3m J ^ 2 2 ^ m + 2 ' 

Finally, we sum over the three cases, and over m, which gives 

E„ /r . . . rl . x f 3X 18Am(2m — 1) v-^ ^ 3 \ 

P P W*iO)nW*)) < 20 + E 25-2^ + ^2^^ j X 

j<k \ m=2 m=2 J 

( 9 18A 5 e \ 19 
^ Uo + "25~'2^ + 2^) X < 50 X ' 

since A < 6/5 and £ ^ 20. It follows from (jTJ) that 

J2(2* + l,p) £ - H - > I ^(2£ + 1)!A^2^ 1 )V +2 , 

if 5 > is sufficiently small, as required. □ 

3.5. Relating P(£,p) and R(£,p). Recall that R(i,p) ^ P(i,p) for every feN. The 
following lemma, which will take some effort to prove, shows that this approximation 
is fairly tight. Combining Lemmas and [T7| will allow us to use the technical lemma 
(Lemma HJ), and hence to prove Theorem [TBI 

Lemma 17. Let 5 > be sufficiently small, and let i e N and p > be such that 
£ 2 2 2i p ^ 5. Suppose the properties (*) /or 2^—1 ZioZc?. T/ien 

P(2*,p) ^ fi + of^+^i2(2f,p). 



P 2 l 



Suppose the properties (*) for 2£ hold. Then 



P(2£ + l,p) < ^(2£+l,p). 
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Remark 4. We shall in fact prove an upper bound for P(2£ + l,p) of the form 



However, the result stated above is actually what we shall need in order to prove Theo- 
rem [13l 

The proof of Lemma [UJ requires several preliminary lemmas, as well as some lengthy 
(but straightforward) calculations, see [6] for the details. We shall begin with a brief 
sketch to provide some motivation. Suppose A percolates in Q = [2] e ; by Lemma [5] there 
exists a pair (S, T) of (proper) subcubes of Q such that S and T are disjointly internally 
spanned by A, and S and T together span Q. Moreover we may take S to be an internally 
spanned proper subcube of Q of maximal size. 



We break into five 


cases: 










• dim(S) =1- 


1 and 


\A 


\ 


S\ 


= 1, 


• dim(S) = £- 


1 and 


\A 


\ 


S\ 


>2, 


• dim(S) =£- 


2 and 


\A 


\ 


s\ 


= 1, 


• dim(S) = £- 


2 and 


\A 


\ 


s\ 




• dim(S) < £ - 


4. 











Note that dim(S') = £ — 3 is impossible, since if d(S, A\S) ^2, then S is not maximal. 
But T = {x G Q : d(S, x) ^ 3} is a subcube, so S U T does not percolate. 

Observe that in the first case one of the events L(i) occurs, and that in the third case 
one of the events M(j, k) holds. 

Definition. Given £ e N and p G [0, 1], and a random set A ~ Bin([2] £ ,p), we define: 

(a) X(£,p) to be the probability that there is a subcube S C [2] £ , with dim(S') G 
{£ — 2, £ — 1}, such that 5 is internally spanned by A, and \A \ S\ ^ 2. 

(6) Y(£,p) to be the probability that [2] £ is internally spanned by A, in such a way 
that no dimension £ — 1 or £ — 2 cube is internally spanned. 

(c) Z(£,p) to be the probability that some (£ — l)-dimensional subcube S C [2] e is 
internally spanned by A, and A \ S is non-empty. 

The following lemma, which gives us our basic upper bound on P(£,p), follows easily 
from the comments above. 

Lemma 18. Let leN and p G [0, 1]. Then 

P(2£,p) <: R(2£,p) + X(2£,p) + Y(2£,p) + Z(2£,p), 



Proof. Suppose that [2] e is internally spanned, and apply Lemma [51 If none of the events 
associated with X, Y and Z occur, then we may choose S so that dim(S') = £ — 2 and 
\A \ S\ = 1, and thus one of the events M(j, k) holds. 




and 



P(2£ + l,p) ^ R{2£+l,p) + X{2£+l,p) + Y{2£ + l,p). 
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Moreover, if £ is odd and the event associated with Z occurs, but that associated with 
X does not, then one of the events L(i) occurs. □ 

The rest of this section will be devoted to proving upper bounds on X(£,p), Y(£,p) and 
Z(2£,p), using the induction hypothesis (i.e., the properties (*)). 

Lemma 19. Let 5 > 0, and let £ e N and p > be such that £ 2 2 e p ^ 5. Suppose the 
properties (*) for £ — 1 hold. Then 

X(£, P ) = o(J^j R(£,p). 

If moreover £ is even, then 

Z(£,p) = o(i)R(£,p). 



Proof. We shall first bound X(£,p). There are four cases to consider: £ is odd or even, 
and dim(S') = £ — 1 or £ — 2. We shall give the details here only the case where £ = 2t + 1 
is odd, and dim(S') = £ — 1; the details of the other cases are similar, and may be found 
in [6]. 

Suppose some 2t-dimensional subcube S of [2] 2 * +1 is internally spanned, and that \A \ 
S\ ^ 2. We have 2(2t + 1) ways of choosing S, and the expected number of pairs in A \ S 
is at most ( 2 2 )p 2 . Thus, writing X±(2t + l,p) for the probability that such a subcube S 
exists, we have 

Xx(2t+l,p) < 2(2t + 1)1 \p 2 P{2t,p) < (2t + 1)! A"*2' 2+ V +3 
by the property (*)(a) for £ — 1. Recalling that, by Proposition 



we obtain 



R(2t + l,p) > T L(2t + l)!A- t 2( i+1 )V +2 , 
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X 1 (2t + l,p) < 100 (2 2t - l p)R(2t + l,p) ^ -j^R(£ } p) } 



since £ 2 2 e p ^ 5. 

Now assume that £ = 2t is even. In order to bound Z(2t,p), observe that there are 2t 
choices for the final direction, and 2 2 * choices for the position of an extra active vertex. 
Thus, by property (*)(&), 

Z(2t,p) ^ ^P P (L«) ^ 2t(2 2t p)P(2t-l,p) 

i 

< 5-(2t)!A- t+1 2* 2+2 y +2 ^ 2 -^(2 2t p)Q(2t, P ) <: ^-R(£,p) 

since £ 2 2 e p ^ 5 and A < 6/5, as required. □ 
Proving the following bounds on Y(£,p) will require more effort. 
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Lemma 20. Let p > be sufficiently small, and let £ £ N, with £ 2 2 2£ p ^ 1. Suppose the 
properties (*) for 2£ — 1 hold. Then 

Y(2£,p) <: ^R(2£,p). 

Lemma 21. Let p > be sufficiently small, and let I £ N, wii/i £ 2 2 2i p ^ 1. Suppose the 
properties (*) /or 2£ /ioW. T/iere 

F(2£+l,p) ^ ~i2(2£+l,p). 
5 

Observe also that Y(£,p) = if £ ^ 5, since if no dimension £ — 1 or £ — 2 cube is 
internally spanned, then dim(S') ^ £ — 4 for any internally spanned proper subcube S, 
and if £ ^ 5 then two cubes of dimension at most £ — 4 cannot span Q = [2] e . 

We shall first bound Y(£,p) for large £, using the properties (*), and then we shall 
count very carefully, using recurrence relations and a computer program, in the cases 
where £ ^ 50. We make no attempt to optimize the constants in the following lemma, 
since bounding Y(£,p) for £ ^ 50 requires little more effort than bounding it for £ ^ 30 
(see Lemmas [231 and [241 below). 

Lemma 22. Let p > be sufficiently small, and let £ £ N with £ 2 2 e p ^ 10. Suppose the 
properties (*) for £ — 1 hold. Then 



Y(£,p) = Ol- e )R(£,p). 

Moreover, when £ is even the constant implicit in the 0(.) term is at most 2 10 , and when 
£ is odd it is at most 2 20 . 

Proof. The proof is quite simple, but involves some lengthy calculations, see [6] for the full 
details. Suppose that [2} e is internally spanned by A; by Lemma [5] there exist disjointly 
internally spanned subcubes S and T as described in that lemma. We show that, if 
dim(S') ^ £ — 4, then the expected number of such pairs is small. 

Let 2^m^k^£ — 4, and suppose that dim(S') = k and dim(T) = m. There are 
^ k (k) wa y s to choose S, and each is internally spanned with probability P(k,p). Also, 
given S, there are at most 2 l ~ m ( ) ways to choose T, and each is internally spanned with 
probability P(m,p). Thus, by the van den Berg-Kesten Lemma, 

Y(£,p) ^ Yl 2n ~ k ~ m ( e ^(^) P ( k >P) P (™>P)- 

since the events 'S" is internally spanned' and T is internally spanned' occur disjointly. 

Now, if i is even then the properties (*) give upper bounds on P(k,p) and P(m,p), and 
Theorem [7] gives a lower bound on R(£,p). The result now follows from a straightforward 
maximization argument. 
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pW 2]+2 ^ (p)R(£,p). 



If £ is odd, we split into the cases k + m ^ £ — 1 and k + m = £ — 2, and note that in 
the latter case, at least one of k and m must be odd. Both cases now follow as before, 
using Proposition H~6l to obtain a lower bound on R(£,p). □ 

This bound is sufficient to prove Lemmas I2"U1 and I2T1 when £ is large: we need 2 l l 2 ^ 2 10 £ 2 
when £ is even, and 2 l ^ 5 • 2 20 £ 2 when £ is odd, so in fact £ ^ 50 suffices. 

Finally we deal with the small cases, which must be calculated more carefully. The 
idea is that, if p > is sufficiently small and £ ^ 50, then the contribution to P(£,p) of 
those configurations with more than \£/2~\ +1 active sites is at most 

2 e 

je/2] +2, 

This restricts the possibilities, and thus allows us to count the remaining configurations 
more accurately. 

Given £ e N and Q = [2} e , we shall write: 

(a) P*(£) for the number of sets A C Q of size \£/2] + 1 which internally span Q. 

(b) R*(£) for the number of such sets for which (if £ is even) one of the events M(j, k) 
occurs, or (if I is odd) one of the events M(j, k) or L(i) occurs. 

(c) Y*(£) for the number of sets A C Q of size \£/2] + 1 which span Q, but internally 
span no subcube of dimension i — 1 or I — 2. 

Note that P*(£) = R*(£) + Y*(£) for every £ e N, since the events corresponding to X(£,p) 
and (if £ is even) Z(£,p) require at least \£/2] +2 infected sites. 

Lemma 23. Let £ ^ 25, and let p > be sufficiently small. Then 

Y(2£,p) < ^i?(2£,p). 

Proof. Suppose that A C Q = [2] 2e spans Q, and that | A\ — £ + 1. Apply Lemma El as 
in the previous lemma, to obtain disjointly internally spanned subcubes S and T with 
dim(S') = k and dim(T) = m, where 2^m^k^2£ — 4. Since \A\ = £ + 1, we must 
have + m = 2£ — 2, and both k and m must be even. Let Jn denote the set of possible 
pairs (k, m), that is 

Ji := {(k,m) : 2 ^ m ^ k ^ 2£ — 4, /c + m = 2£ — 2, and both and m are even}. 
There are at most 2 2£ ( 2 ^ ( 2 ^ fc ) ways of choosing S and T. Thus 

Y*(2£) ^ J2 ^Ck)^^) 1 "*^*^- (8) 

Recall that P*(2£) = R*(2£) + F*(2£), and observe that 

\S{£)\ < i?*(2£) ^ ^2 2£ P*(2£-2), 

where 5(£) is the set in Theorem [3, by the union bound over the events M(j, k) A {\A\ = 
£+ 1}. Recall that we can calculate \S(£)\ exactly using (j3J). 
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Using these recursions, it is clear that we can obtain an upper bound on Y*(2£) for 
every £ G N. In order to prove the claimed bounds, we first calculate the base cases 
slightly more carefully To be precise, we shall use the following easy observations. 

Claim : P*(4) = R*(4) = 144 and F*(6) 6! ■ 2 4 . 

Proof of claim. The first part follows because |«S(2)| = 144 (by (J3J)), and if A internally 
spans Q = [2] 4 and \A\ = 3, then A sequentially spans Q. To bound Y*(6), we use (jSJ) 
and the following observation: the only pair (k,m) G J$ is (2,2), but now the formula 
above can be improved by a factor of two, since we are double counting (the cubes S and 
T are indistinguishable). Thus 

F*(6) < 2 5 QjQjP*(2) 2 < 6!-2 4 , 

as claimed. □ 

Now, using a simple computer program and the recurrence relations above, we obtain 
the following values for P*(2£), R*(2£) and Y*(2£). 



2i 


2 


4 


6 


8 


10 


12 


14 


\S{£)\ « 
P*(2£) ^ 
R*{2£) ^ 
Y*(2£) ^ 

9 £ Y*(2£) < 
\S(t)\ ^ 


2 
2 
2 




144 
144 
144 






116160 
149760 
138240 
11520 
0.794 


7 x 10 8 
1.2 x 10 9 
1.1 x 10 9 
3.1 x 10 7 
0.696 


3 x 10 13 
6 x 10 13 
6 x 10 13 

4 x 10 11 
0.440 


7 x 10 18 
2 x 10 19 

2 x 10 19 

3 x 10 16 
0.264 


8 x 10 24 
2 x 10 25 
2 x 10 25 
2 x 10 22 
0.155 



Continuing in the same way, we observe that 2 e Y*(2£) < (4/5)|cS(£)| for all 
Thus 

Y(2£,p) ^ Y*(2£) P e+1 + (/^ 2 )p' +2 < (j + 0(P)) ^(2*,p), 
so the lemma follows. 

Finally, we need to bound Y(2£ + l,p) for small £. 
Lemma 24. Let p > be sufficiently small, and let £ ^ 25. Then 

Y(2£+l,p) <: Ir(2£+1, P ). 
5 

Proof. The proof is very similar to that of Lemma [231 but we have to consider two ad- 
ditional cases: the case where k and m have the same parity, and the case where one of 
them is even and the other odd. Recall the definitions of Y*(£), P*(£) and R*(£) from 
above, and apply Lemma [51 to obtain disjointly internally spanned cubes S and T with 
[S U T] = [2] 2e+1 . Let k = dim(S') and m = dim(T), where k ^ m. 



< £ < 25. 



□ 
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We first claim that either k + m = 2£ — 1, or k + m = 2£ and k and m are both even. 
This follows by Lemma El since a witness set for S to be internally spanned has size at 
least k/2 + 1, and one for T has size at least m/2 + 1, and \A\ = £ + 2. We deal with 
these two cases separately. In the first case we get 

Y*(2£ + l) < Yl ^^k^^^L'^^^*^' (9) 



k+m = 2£-l 



and in the second case 



Y*(2£ + l) < Y 22 ,-i(2^ + l)!(fcm + 4) p 

< * h\ ml 



fc+m = 2£ 
fc,m even 



since we have 
o2m 



k J\ m J \ k J \1 J \ m — 1/ fc!m! 

ways of choosing 5 and T. By the comments above, y*(2£+ 1) = Y{(2£ + 1) + F 2 *( 2 ^ + !)• 
Next, recall that P*(2£ + 1) = R*(2£ + 1) + F*(2£ + 1), and observe that 



2 2e+1 \S(£)\ ^ R*(2£+l) ^ (2£+ 1)2 2£+1 P* (2£) + ^ + ^2 2m P* 



(2£-l). 



The lower bound follows by considering the event L(l); the upper bound follows by using 
the union bound over the events M(j, k) and L(i), as in Lemma O 

Using these recursions, we can obtain an upper bound on Y*(2£ + 1) for every I e N. 
We first calculate P*(3) exactly. 

Claim : P*(3) = P*(3) = 32. 



Proof of claim. We count the number of ways in which three sites can percolate. If two of 
the sites are in opposite corners, then the triple percolates: there are 24 such configura- 
tions. If not, then each pair of points is at distance two (take two points at distance one, 
and consider the position of the other). There are eight ways to choose the first point, 
three to choose the second, and two to choose the third, and we get each triple in 3! = 6 
ways; thus there are 8 configurations of this type. It is easy to see that P*(3) = P*(3). □ 



Now, using a simple computer program and the recurrence relations above, we obtain 
the following values for P*(2£ + 1), R*(2£ + 1) and Y*(2£ + 1). 
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9£ 4- 1 


1 
± 


Q 

o 


o 


7 
t 


Q 


1 1 


1 3 


F*(2£+ 1) ^ 
P*{2£ + 1) < 
i?*(2£+ 1) < 

r*(2£+ 1) ^ 

Y*(2t+l) ^ 
R*(2t+1) ^ 


1 
1 
1 





32 
32 
32 




4608 

33280 

33280 






1.4 x 10 7 
2 x 10 8 
2 x 10 8 
1720320 
0.116 


3.6 x 10 11 
9 x 10 12 
9 x 10 12 

6.3 x 10 10 
0.171 


5 x 10 16 
2 x 10 18 

2 x 10 18 

3 x 10 15 
0.047 


5 x 10 22 
3 x 10 24 
3 x 10 24 
7 x 10 20 
0.013 



Continuing in the same way, we observe that Y*(2£ + 1) < (9/50)i?*(2£ + 1) for all 
< £ < 25. Thus 

Y(2£ + l,p) < F*(2£+iy +2 + Q^)/ 4 " 3 < Q^ + O(p))j2(2*+l,j0, 

and so the lemma follows. □ 

We can now deduce our upper bounds on Y(£,p). 

Proof of Lemmas\2^ and EH Let p > be sufficiently small, and let leN, with £ 2 2 p ^ 
1. Suppose the properties (*) for 2£ — 1 hold. If £ ^ 25 then we have 

Y(2£,p) < ±- e R(2£,p), 

by Lemma [231 whereas if £ > 25, then 

2 10 f2^ 2 1 
Y(2£,p) <: -A^L R (2£ )P ) <: - £ R(2£,p), 

by Lemma [22} as required. Now suppose the properties (*) for 2£ hold. If £ ^ 25 then 

Y(2£ + l,p) ^ ~R(2£ + l,p), 

by Lemma [2H and if £ > 25 then 

2 20 f2/+ll 2 1 
Y(2£+l,p) ^ ^ J R(2£+l,p) ^ -R(2£ + l,p), 

by Lemma [221 as claimed. □ 
Finally we may put the pieces together, and prove Lemma [T71 



Proof of Lemma 17. Let 5 > be sufficiently small, and let p > and £ G N, with 
£ 2 2 2 V sC 5. By Lemma QSJ 

F(2^p) < R(2£,p) + X(2£,p) + Y(2£,p) + Z{2£,p). 

Suppose the properties (*) for 2£ — 1 hold. Then, by Lemmas [T9l and |20| we have 
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as required. Next, by Lemma [TH] we have 

P(2£ + l,p) < R(2£+l,p) + X(2£+l,p) + Y(2£ + l,p). 
Suppose the properties (*) for 2£ hold. Then, by Lemmas [T9l and |2~T1 we have 

P{2£+l,p) ^ fl + of^) +^\r(2£+1, p ) <; ^R(2£+l,p), 

since 5 > is chosen to be sufficiently small, as required. □ 

3.6. Proof of Theorem 1131 Theorem [TBI now follows by combining Lemmas [T4l and [XTj 

and using the technical lemma, Lemma HI 

Proof of Theorem Let C > be a constant to be chosen later, let 5 = 6(C) > be 
sufficiently small, and let p > 0. For each £ G No, define a function / by 

R(2£,p) = f(2£)z(2£)p i+ \l-pf t ^- 1 (10) 

and 

R(2£+l,p) = /(2£ + l)(2£+l)!A^2 (m) V +2 , (11) 
where z{2£) = (2£)\X~ e 2 e2 is the function defined in ([5]). Also, for each £ G No define 

P(£,p) = h(e)R(£,p). 

We claim that there exists a function g : No — > K + , with V^g(m) ^ -, such that the 

3 

m=0 

following four conditions hold for every £ G N with £ 2 2 2t p ^ 5: 



(a) f(2£) ^ ^exp U^aE 

\ m=0 

(b) f(2£ + l) <: 4 



e-i 

g(m) 



(c) 1 ^ h(2£) ^ I + gift). 

(d) 1 ^ h(2£ + l) <: -. 

In fact, we shall prove, by induction on £, that these conditions hold for the function 
0(0) = 0, g(l) = g{2) = CS, and 

. , C5 1 
aim) = — - + — 
yy ' m 2 2 m 

for every m ^ 3. 

The proof will be by induction on £, so we begin with the base cases. We first bound 
f(£) for £ ^ 5; to be precise we claim that 

/(0) = 1, /(l) = l/2, f(2)=X/2, /(3)<1, /(4)<A/2, /(5) < 1. 

The values for £ ^ 2 follow since _R(0,p) = p, R(l,p) = p 2 and R(2,p) = 2p 2 {l — p) 2 . 
For 3 ^ £ ^ 5, recall (from the proof of Lemmas [231 and [241) that R(3,p) = 32p 3 + 0(p 4 ), 
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R(A) = 144p 3 + 0{p A ), and R(5) < 33280p 4 + 0{p 5 ), and note that p ^ 5/£ 2 2 21 ^ 5 is 
sufficiently small. Thus 

and similarly /(4) < 4!A 1 ^° 2 4 < A/2, and /(5) < 5 f^°° 9 < 1. Next, we observe that 
h(0) = h(l) = 1, by the bounds above, and claim that 

1 < h(£) ^ 1 + CS 

for £ e {2,3,4,5}, as long as C is chosen to be sufficiently large. Indeed, recall that if 
£ < 5 then Y(£,p) = 0, and thus P(£,p) = R(£,p) + 0(p^ +2 ), by Lemma M But 
p ^ 5, and so the bounds on h(£) follow. 

So let £ ^ 3, and assume that the claimed upper bounds on f(t) and h(t) hold for every 
t ^2£ — l. It is important to note that this implies that the properties (*) hold for 2£ — 1. 
We shall prove the claimed bounds for t = 2£ and t = 2£ + 1; we begin by re- writing (the 
first part of) Lemma [H] in a more useful form. 

t 

Claim 1 : f{2£) = ^(-l) m+1 a m h{2£ - 2m) f{2£ - 2m). 

m=l 

Proof of claim. By Lemma [T4l since £ ^ 2, we have 

J(2£ - 2m) 



R(2£,p) = ^(-l) m+1 a m p m S 2i , m , p —^—P(2£-2m,p), 



and by the definitions above, 

P(2£) = h(2£)R(2£,p) = h(2£) f{2£) z(2£) p i+ \l - p) 2 ""'" 1 . 
The claim now follows with a little algebra. □ 
Now, recall that by the induction hypothesis, 

h{2l-2m) ^ 1 + g(£-m) 
for each m e [£}. Thus we may apply Lemma HJ to obtain 

6-1 



/(*) ^-p 



2 V 2 — A ; 

\ m=0 

as required. Also, since the properties (*) for 2£ — 1 hold, by Lemma [171 we have 

H2£) < l + ^ + i = 1 + <?W, 

since £ ^ 3, as long as we chose C to be sufficiently large. 

We have thus proved the claimed upper bounds on f(t) and h(t) for t = 2£. Observe 
that therefore the properties (*) hold for 2£. 

Next we write the other part of Lemma [TH in a more useful form. 
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Claim 2 : f(2£+l) < 1 + ^ h(2i - 1) f(2l - 1). 

Proof of claim. Recall that, since £ ^ 2, by Lemma IT41 we have 

R{2£+l,p) <: (2£+l)2 2£+1 pP(2£,p) + ( 2 ^ l \2 2t+l pP(2£-l,p). 



2 

We also have, by the properties (*) for 2£, that P(2£,p) sC (2£)! A - '2*y +1 , and that 

P(2£-l,p) = l*(2f-l)/(2f-l)(2f-l)!A- w 2V +1 . 
The result now follows with a little algebra. □ 

Now, by the induction hypothesis, h(2£ — 1) ^ 5/4 and f(2£ — 1) ^ 4. Hence, by 
Claim 2 it follows that 

f(2£+l) ^ 1 + ^(2*-l)/(2*-l) < 1 + | /(2^ - 1) < 4, 

since A < 6/5, as required. The bound /i(2£ + 1) ^ 5/4 follows by Lemma ITT1 since the 
properties (*) for 2£ hold. 

Hence we have proved the claimed upper bounds on f{t) and h(t) for t = 2£ + 1, and 
the induction step is complete. Taking 5 = 5(C) sufficiently small, we have 

y g (m) < 2C6 + y + — ) < - + 3C6 < -, 
y ^ ^ \ m 2 2 m / 4 3 

m=0 m=3 7 

and Theorem [TBI follows. □ 

Finally, we can deduce Theorem [31 

Proof of Theorem 0. The lower bound in part (a) follows from Theorem [7J and the lower 
bound in part (b) follows by Proposition (THl The upper bounds follow easily from Theo- 
rem [T31 as noted in Section 13.21 □ 

4. An upper bound for the critical probability 

In this section we shall deduce the upper bounds in Theorems [Hand [2] from Theorem [71 
We shall not need Theorem [31 in order to prove the upper bounds. 

Let n = n(d) be a function satisfying d ^> logn ^ 1 as d — > oo. We shall prove that if 

p = iX (»)'U 1 + 5Co^ + lllog n -ll\ ^, 
\n — 1/ d z \ y/d log n J 

and A ~ Bin([n] d ,p), then P(^4 percolates) — > 1 as d — > oo. Note that when n(d) = 2 for 
all d G N, this gives the desired bound on the hypercube, and when d ^> logn it gives the 
bound in Theorem [2l 

Let e = e(d) e R with |e(d)| = 0(1), let a = y/d log n — \_y/d \ogn\ e [0, 1), and set 

.)' = log !\( — ^) +.-. 



n — 1 
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Set £ = [y/d lognj + [log \fd\ , and note that £ -C d, since d ^> logn, and that 



d log n 



log Vd 



+ a 



Furthermore, set 

s = 2 v / rflogn + 21ogd-/3, (12) 
and let p = 2~ s . Note that d2 2l p = 6(1). Write X(2£,p) for the number of [2] 2£ -subcubes 
of [n] d which are sequentially internally spanned by a set A ~ Bin([n] d ,p). 

Lemma 25. Let n = n(d) satisfy d 3> logn ^ 1, and Zet £ G N and p > be as defined 
above. Then, if d is sufficiently large, 



2/' 



different [2] -subcubes in [n] , and each is sequen- 



/A j/n-1 
Proof. There are I In I 

tially internally spanned with probability Q(2£,p). Observe also that 

dlogn + f = (i- [logv^J +a) 2 + f ^ 2£ 2 - 2^[logVdJ + 2crf + ^(logd) J 
and that 

s(£ + l) = 



2^dlogn + 21ogd-/3J ^ + 1 
2£ - 2 [log >/dJ + 2a + 21ogd - ^ ^ + 1 
< 2£ 2 - 2£[log\/rfJ + 2^1ogd + l + 



« - § ) + - 



Note that 2 2£ p = o(l) as d — > oo. Thus, by Theorem [3, 



2(" 



-(2£)!A- £ 2V +1 
5 



2 /d-2A 2 Vn-T 24 



5 V Vx 

^ I d-2£ ^ ( n - 1 ^ 2 - 2 ^iogd-/3/2+i)^ 



n 



But 



and 



71—1 



4A 



72 — 1 



1 1/2 



2£ /2 



d-2l 
d 



2f 



exp 



10' 
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if d is sufficiently large, since £ 2 = (f + o(l))dlog 2 n 2cflnn, and £ <C d. Thus 

E(X(2£,p)) > ^, 

as required. □ 

We wish to deduce that, with high probability, some [2] 2 ^-subcube really is internally 
spanned; we shall do so by bounding the variance of X(2£,p). In |4J it was shown that 

Vai(X(2£,p)) = 2 o{l \ 

We shall need the following more precise bound on the variance. (Our bound is in fact 
close to best possible, see Remark [6] below.) Here we make crucial use of the fact that we 
are counting sequentially internally spanned subcubes. 

Lemma 26. Let p > and £ G N, with 2t < d and 2 2i p < 1. Then 

V&r(X(2£,p)) <: d 3logd E(X{2£,p)). 

We shall use the following lemma in the proof of Lemma 1261 
Lemma 27. Let leN and < m < £. Let B C [n} d with \B\ = £ - m + 1. Define 

S{B,£) := {A D B : |A| = £ + 1 and A sequentially spans a hypercube Q C [n] d }. 
Then 

m 

\S(B,£)\ <: rf 2m+2-Q 2 2^+3 

3=1 

Note that, if A 6 S(B,£), then the definition implies that the hypercube Q = [A] has 
dimension 2£. 

Proof. We cover the set S(B,£) with sets S q as follows. For each ^ q ^ £, let <S g denote 
the collection of sets A e £) such that there exists a spanning sequence (do, ■ ■ ■ , ai) 
for [A] (i.e., an ordering of A such that d(a,j + i, [{a , . . . , %•}]) = 2 for each ^ j ^ £ — 1), 
such that g = min{t : a f G -B}. 

Claim : |«S,| ^ d 2m+1 Ylf=i 2 2i ~ 2 ^ 3 for each ^ q ^ £. 

Proof of Claim. Let C = A \ B, and note that \C\ = m. We are required to count the 
possible sets C such that B U C 

First we choose the element a q G B (at most £ + 1 choices). Next, if q ^ 1, we choose 
the 2 g- dimensional hypercube R = [{a , . . . , a g }]. Note that a g G R, so there are at most 

2 2 4 d ' 
\2q 

choices for R. Moreover, given R, there are at most (2q)\2 q2 choices for the set C R R, by 
Theorem [71 since (C D R) U {a g } must sequentially span R. If g = then let R = {ao}. 

Now let Ri denote the largest hypercube in [R U B] such that R C R%, and let 2r\ = 
dim(i?i). We claim that there exists a vertex c\ G C with d(c\,R\) = 2; indeed, the 
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minimal element c of C \ {ao, . . . , Q> q -i} (in the order a < ■ ■ ■ < at) is at distance 
exactly two from Ri. This follows because (a , . . . , at) is a spanning sequence for [A] (so 
d(c, Ri) ^ 2), and if d(c,Ri) < 2 then the dimension of [Ri U {c}] would be too small, 
and A would not span a hypercube of dimension 2£. 
There are at most 

choices for ci, since c\ e R 1 ± ej ± for some pair of directions {i, j}. Now iterate this 
process: to be precise, for each 2 ^ j ^ m — q, let denote the largest hypercube in 
[Rj-iUB] with R C -Rj, let 2r,- = dim(Rj), and choose a vertex Cj G C with d(cj, Rj) = 2. 
There are at most (f\2 2rj+2 choices for cj. 

Finally, putting the pieces together, we have at most 

(I + 1)2^ (^J (2q)\ r 2 max JJ Q ^ 

choices for C, where the maximum is taken over all sequences r = (r 1; . . . ,r m _ q ) with 
q ^ r\ < . . . < r m _ q < t. Note we may assume that 2£ ^ d, since otherwise \S(B, £)\ =0. 
Hence 

m—q m 

\S q \ ^ d 2m+l 2 q2+2(1 Y[ 2 2e ~ 2j+2 <: d 2m+1 Y[2 2e - 2l+s , 

3=1 3=1 
m 

as claimed, since q ^ m ^ £, and so (2£ — 2j + 2) ^ g(g + 1). □ 

j=m— q+1 

The lemma follows immediately from the claim, since S(B,£) C IJq^g- ^ 

Remark 5. In the proof above we used (and proved) the following nice fact about 
spanning sequences. Suppose that (a , . . . , ai) is a spanning sequence for Q, and let 
{b , . . . , b t } C {ao, • • • ) a e} with b = a and d(bj + i, [{b , . . . , bj}}) = 2 for each j ^ t — 1. 
Then (6 , ■ ■ ■ ,b t ) can be extended to a spanning sequence of Q that is a permutation of 
(ao, • • • , at)- 

We can now prove our claimed upper bound on the variance of X(2£,p). 

Proof of Lemma [2UI We are required to bound K(X(2£,p) 2 ^) , the expected number of 
pairs (Q,Q'), where Q and Q' are 2£-dimensional subcubes of [n] d , such that both Q 
and Q' are sequentially internally spanned by A. We do so by first bounding, for each 
^ m ^ £+ 1, the expected number of pairs for which \A D Q D Q'| = £+1 — m, and then 
summing over m. Recall that, since Q and Q' are sequentially spanned by A, it follows 
that \AC\Q\ = \AnQ'\= e+i. 

First note that if m = £ + 1, then the sets y4flQ and AnQ' are disjoint, and so, by 
the van den Berg-Kesten Lemma, the expected number of pairs is at most E(X(2£,p)) . 
So fix ^ m ^ £, fix the cube Q C [n] d , and recall that the number of ways of choosing 
a set A n Q which sequentially internally spans Q is denoted |<S(£)|. 
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Now, given the set AnQ which sequentially spans Q, and assuming that |y4nQnQ'| = 
({+ 1 — m, we have ( +1 ) ^ d m ways of choosing the set B = AdQdQ' . By Lemma 1271 we 
then have at most d 2m+2 YljLi 2 2 ^ _2 - ?+3 choices for the set A D Q', given B. Hence, given 
Q and m, the number of choices for the pair (A(lQ,An Q') is at most 

m 

\S{£)\d 3m+2 l[2 2e ' 2 ^ 3 . 

i=i 

Thus, 



E(X(2£,p) 2 ) < E(X(2£,p)) 2 + ^ El 5 

m=0 Q:dim(Q)=2e 



3m+2 



b=i 



l+m+l 



and hence, using the bounds 2 2£ p ^ 1 and 21 ^ d 



E(X(2£ } p) 2 ) - E(X(2£,p)) 2 < ^ E(^(2l,p)) (2d) 3m+2 JJ 2~ 2j ' 

m=0 j'=l 

*C d 31ogd E(;f(2£,p)), 

as required. The final inequality follows because the maximum in the sum occurs when 
m«3(logd)/2. □ 

Remark 6. If d 2 ~ c 2 2£ p ^ 1 for some c > 0, and log<i <C £ <C d, then we also have 

Vax(AT(2£,p)) ^ d 51ogd E(#(2^p)) 

for some 5 = 5(c) > 0, so the upper bound in Lemma [26] is close to best possible. To 
see this, consider only the cases m = £ + 1 and m = (c/2) logrf, and assume that the set 
B = AdQdQ' consists of the first £ + 1 — m elements of the spanning sequences of both 
Q and Q' . We obtain (approximately, since we did not prove a lower bound in the case 
m = £ + l), 



d-2£ 
2 



2 2e-2j 



V 



E(X(2£,p) 2 ) - E(X(2£,p)) 2 > E(X(2£,p)) 1- f[ 

m - j= i 

/ Jc \m m 

^ E(X(2£,p)) I 1 J\2- 2 3 > d sl ° sd E(X(2£,p)), 

\ 111 / j=1 

for any 5 < c 2 /4, as claimed. 

Recall that d2 2i p « 1 for £ « y/ d log n + log \fd and p = 2~ s , as defined above, so this 
lower bound holds in the case in which we are interested. 



We can now deduce that X(2£,p) is large with high probability by the standard second 
moment method (see [12], for example). 
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Lemma 28 (Chebyshev's inequality). Let X be a random variable, and lett > 0. Then 

Var(X) 



F(\X-E(X)\ >t) < 
P(X = 0) < 



t 2 ' 

and Zience 

Var(X) 
E(X) 2 ' 

The following result, which follows immediately from Lemmas |25j [26] and [281 sum- 
marises what we have proved so far. 

Corollary 29. Let n = n(d) satisfy d ^> \ogn ^ 1, let £ = [y/d\ogn\ + [log yfd\ , and let 
p = 2~ s > be as defined in (fl2l . Suppose that e£ — 3(logd) 2 — 10 \ogn — >• oo as d — )■ oo. 
T/ten, 

P(#(2£,p) = 0) ->■ 

as d —7- oo. 

Proof. By Lemmas [281 126] and [25] respectively, we have 

p(*<*,p) = o) S ^Li <m < - 

as d — > oo, since e£ — 3(logd) 2 — 10 log n — >■ oo. □ 

Corollary [29] tells us that, with high probability, there exists an internally spanned 
[2] 2 ^-subcube in [n] d . All that remains is to show that such a subcube has probability 
1 — o(l) of infecting the entire vertex set. 

Indeed, we want to grow one of our cubes, Q, step by step - first to a [2] d / 2 -cube, and 
then to an [n] d -cube - and show that at each step the probability that we get stuck is 
small. There is a problem however, because the steps are not independent of each other; 
in fact we are (essentially) summing over a large number of overlapping paths. We solve 
this problem (as in [4]) using 'sprinkling', i.e., by dividing up p into pieces, and using a 
different set of sites for each step. 

To be precise, we shall set 

* _ 1 -2Vdlogn 

and for each j e N, let pj = 2~ip* and let Aj ~ Bin([n] d ,pj) be independently chosen sets 
of infected vertices. Observe that 

2 2 V > 1 

where £ = [y/ d log n\ + [log \fd\ is the function defined above. 

Lemma 30. Let d,£ G N and p* > satisfy 2 2e p* ^ l/(16d 3/2 ), and let Q C [2} d be a 
hypercube with dim(Q) = 21. Let A' = Q U A, where A ~ Bin([2] d ,p*). Then, 



F P *(SR C [A'] : dim(ii!) ^ d/2) ^ 1 - 2exp \-Ly/dj _». 1 



as d —7- oo. 
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Proof. For each j e N, set k = 2£ + 2j-2 and pj = 2~ j p*. Let F Pj (2 k -fr 2 k+2 ) denote the 
probability that, given a [2] fc -cube Q' C [2} d and a random set Aj ~ Bin([2] d , pj), there is 
no active site v G Aj satisfying d(v, Q') = 2. Note that if d(v, Q') = 2 then \Q' U {v}] is a 
[2] fe+2 -cube. 

Claim : If k = 2£ + 2j - 2 ^ d/2, then P Pj (2 fe 2 fc+2 ) ^ exp (^-2^ 10 v / rf). 

Proq/ o/ claim. Let be any k- dimensional subcube of [2} d . There are at least (^ 2 ) ^ 
d 2 /9 disjoint [2] fc -cubes Qi, . . . , Q t in [2] d , such that each vertex v e (Jj <5? is at distance 
exactly 2 from Q'. Thus 

P p ,(2 fe ^2 fe + 2 ) < (1- Pj ) m2fc < exp (- (2-V) y2 2 ^- 2 ) 

< exp (^-2- ? '- 10 v / rf) , 

as claimed, since 16d 3 ^ 2 2 2e p* > 1 □ 
Hence, since UjAj is stochastically dominated by A ~ Bin([2] d ,p*), we have 

d/2 

P p * (3RC [A'] : dim(iZ) ^ d/2) > 1 - ^P Pj . (2 2e+2 ^ 2 y=> 2 21 ^) 

3=1 

oo / 1 \ 

^ 1 - ^ exp (-2^°^ ^ 1 - 2exp(- — v^j ->• 1, 

as d — > oo, as required. □ 

Finally we show that each [2] d / 2 -cube almost surely grows to cover the whole of [n] d . 

Lemma 31. Let n = n(d) € N satisfy d ^> loglogn + 1, and let p* = p*(d) > satisfy 
p* ^ 2~ d/3+o(d) as d ^ oo. Let R C [n] d fee a hypercube with dim(i?) = |_^/2J, arid Ze£ 
A' = i? U A, where A ~ Bm([n] d ,p*). Then 

F P .([A'] = [<) > 1- (2n) d exp (-2^+°^) -> 1 

as d oo. 

Proof. If A' does not percolate, then there must exist a [d/2 J -dimensional hypercube 
5 C [n] d with A fl S = 0. Let Q denote the collection of such hypercubes, and note that 

P p .(SeQ) ^ (1-p*)^ ^ exp (-p*2 Ld/2j ) <: exp ( - 2 d/6+o(d) ) 

for any S. There are at most {2n) d hypercubes S with dim(S') = [_d/2\ in [n] d , and thus 

F r ([A'}^[n] d ^ ^ E(\Q\) ^ (2n) d exp(-2 d / 6+ °W), 

as required. □ 
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The upper bounds in Theorems [T] and [2] now follow easily. 
Theorem 32. Let n = n(d) be a function such that d S> logn ^ 1. Let 

„ = 4A ( - ArV 1 fl + 5(log^ + lUogn-ll\ 2 - 2 ^ 
\n - 1/ d 2 \ Vdlogn / 

and let A ~ Bin([n] d ,»). TTien 

P P ([A] = [n] d ) 1 

as d — » oo. 
Proof. Let 

e = e(d) := log 



1 + 



5(logd) 2 + 11 logn — A/log n — 11 



and let /3 = log 



P 



4A 



4A 



n 



n — 1 

2 1 



\/ 3 log n 

+ £. Set s = 2 a/ 3 logn + 2 logd — /3, and note that 
5(logd) 2 + 11 logn - 11' 



2 -2Vdiogn ^ 2~ s + 2p*, 



\/d log n 

where p* = d" 5 / 2 2" 2 v /a!lo s™, as before. Set p x = 2~ s , and let A* = A x U A 2 U A 3 , where 
A\ ~ Bin([ra] d ,pi) and A 2 ,A^ ~ Bin([n] d , »*). It is easy to see that A* is stochastically 
dominated by A ~ Bin([n] d ,p). 

Let £ = [\/3 log nj + [logv^J, and let E(l) denote the event that some hypercube 
Q C [n} d with dim(Q) = 2£ is sequentially internally spanned by A±. Let E(2) denote 
the event that some hypercube R C [n] d with dim(i?) = |_d/2j is internally spanned by 
A\ U A 2 , and let E(3) denote the event that A* percolates. 

Observe that, since (logd) 2 + logn <C y/d logn, we have (for large d) 

'4(logd) 2 + 10 logn 



et > 



^ 4(logd) 2 + 10 logn, 



\fd log n 

and so we may apply Corollary [291 Hence, by Corollary 
we have 

P p (L4] = [n] d ) > P(#(l)) -P (^(2)|S(1)) -P (#(3) | £(2) 
as d — > oo, and the theorem follows. 

5. A LOWER BOUND ON THE CRITICAL PROBABILITY 



Lemma EDI and Lemma 



-> 1 



□ 



In this section we shall deduce the lower bounds in Theorems [T] and [2] from Theorem [3j 
To be precise, we shall show that, given any function n = n(d) with d ^> logn ^ 1, if 

p = 4A|-^Yjfl + lo g^- 161 °S^+^^ 2 v^ 



d 2 



a/ d log n 
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and A ~ Bin([n] d ,p), then ¥(A percolates) — > as d — > oo. Note that when n(d) = 2 for 
all d G N, this gives the desired bound on the hypercube, and when d 3> logn it gives the 
bound in Theorem |2j 

We shall use the basic method of jl] , but we will need to do some more careful counting, 
and also to bound the probability that an arbitrary cube (of sufficiently low dimension) 
is internally spanned. Indeed, let n = n(d) satisfy d logn ^ 1, let e = e(d) G M with 
e(d) < 1, and set 

/ \ 2" 

4A 



j 9 = log 



+ 2e. 



Set £ = 2 (a/ d log n — a) G N, where a > is a constant to be chosen later, and note that 
f C (J, since d S> logn, and that dlogn = (£ + 2a) 2 /4. Furthermore, set 



s 



2y/d\ogn + 2logd - f3, (13) 

and let p = 2 _s . Throughout, we assume that A ~ Bin([n] d ,p). Note that, if d is 
sufficiently large, then 

< 2^ 
a 

where 5 > is the constant in Theorem [31 since d ^> logn. 

We shall count the expected number of pairs (S,T), such that S, T C are cubes 
satisfying the following conditions: 

(a) S and T are disjointly internally spanned by A, 

(b) dim(T) ^ dim(S) < £, 

(c) [/ = [S 1 U T] is a cube, and dim({7) ^ £. 

Definition. For each i G N, let denote the collection of pairs {S,T) in [n] d which 
satisfy conditions (b) and (c) above. Let C denote the number of pairs in Ui which 
also satisfy condition (a). We shall write Ug = \Ug\ and Wt = \Wt\. 

The idea of the proof comes from the following slight generalization of LemmaEl versions 
of which appear in both [1] and [26]. For completeness we give the proof. 

Lemma 33. If A C V — \n] d percolates, and 1 $C t ^ dim(K), then Wg^\. 

Proof. Consider the following 'cubes process'. We start with a set of m := \A\ cubes 
Co = {Ci, . . . , C m } of dimension zero (i.e., single points), and at each step we choose two 
of the cubes Ci,Cj G C t , such that d(Ci,Cj) ^ 2, and form Ct+i by replacing them with 
the cube [Ci U Cj]. Thus, for each t G N, 



Ct+i := (CtMC^C^UiidUCj]}. 



Since A percolates, we may continue this process until we are left with the single cube, 
[n] d . Observe that, at each stage of this process, the cubes in Ct are all disjointly internally 
spanned by A. 
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Now, simply consider the first time t at which there is a cube U aC t with dim({7) ^ £. 
Then in C t -i there must exist cubes S and T satisfying conditions (a)-(c) above. Hence 
W 7 ^ ^ 1, as required. □ 

We shall show that if e G K is sufficiently small and f 6N and p > are as defined 
above, then E P (IV^) <C 1. By Lemma [331 h will follow that P P (A percolates) — > as 
d — > oo. 

In order to bound Ep(W^), we shall need to bound from above the probability that an 
arbitrary cube Q is internally spanned, given dim(Q). The following lemma, combined 
with Theorem [31 allows us to do this. Let P(Q,p) denote the probability that [A] = Q, 
given A ~ Bin(Q,p), for an arbitrary cube Q. 

Lemma 34. Let n,d,£ G N and p > 0. Suppose Q C [n] d is a cube with dim(Q) = L 
Then, 

P(Q,p) < P(£,p). 

Proof. We describe a very simple coupling, from which the result follows immediately. 
Indeed, let a 1; . . . , G N, and assume that a\ ^ 3. Let S C [n] d+1 be an [a x ] x . . . x 
[ad] x [l]-cube, and let T C [n] d+1 be an [ai — 1] x [a 2 ] x . . . x [aj x [2]-cube. Note that 
dim(S') = dim(T). We claim that 

P(S,p) ^ P(T,p), 

and moreover that there is an injection : S — > T such that if A percolates in S then 
4>{A) percolates in T. 

Indeed, let 0' denote the natural isomorphism (as graphs) between S and the subgraph 
T' of T induced by the vertex set 

{x G T : Xd+i = 1} U {xGT : X\ = ai — 1 and x^+i = 2}, 

and let be the injection (of sets) induced by 0'. Thus, given A C S 1 , we define 0(A) C T 
as follows: 

(a) If x G T with x^+i = 1, then x G <fi{A) <^> x G A, 
(6) (xi, . . . , Xd, 2) G 4>(A) xi = ai — 1 and (a±, X2 ■ ■ ■ , Xd, 1) G A. 
Since the graphs S and T' are isomorphic, it follows immediately that if [A] = V(S), then 
V(V) C [<p{A)]. But [V(T')] = V(T), so 0(A) percolates in T, as required. □ 

Remark 7. Note that the proof of Lemma [341 does not work on the torus ZJ*. 

Now, given fc,m G N, with k ^ m, let Ue(k,m) C denote the set of pairs (S,T) G 
with dim(S') = and dim(T) = m, and let We(/c, m) = Ut(k,m) D Wg. We write 
Ut(k,m) = \Ui(k,m)\ and Wg(k,m) = \Wi(k,m)\. Thus 

W/ = X W^,m). 

fc,mSN 

Given n, d and £, as defined above, we shall bound E p (W^(A;,m)) for each k,m G N. 

The following simple observation will help us bound the number of pairs (S, T) of 
subcubes of [n] d , with dim(S) = k, dim(T) = m and d(S,T) ^ 2. 
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Observation 35. Let n,d G N and a%, . . . , aa G [n], let k = Y2i=i( a i ~ 1)> an d suppose 
that k ^ d. Then 

k 



JJ a .( n _ a . + l) ^ 2 k n d {^-^\ 
i=i ^ ' 



Proof. The result clearly holds when a« G {1,2} for each i. Suppose that ai ^ 3; since 
k ^ d, it follows that a« = 1 for some z, so let a2 = 1. Let a[ = ax — 1, a' 2 = 2 and = a« 
for i ^ 3. Then 

]^[a-(n - a- + 1) ^ J^J aj(n - a« + 1) 

i i 

if and only if 

2(ai — l)(n — l)(n — ai + 2) ^ ain(n — ai + 1). 
But this holds since a\ ^ 3, so 2(aj — 1) ^ a 1; and (n — l)(n — % +2) ^ n(n — ai + 1). □ 

We can now bound Ue(k,m), i.e., the number of pairs (S,T), for which the event 
(S,T) G Wf(A;,m) has positive probability. 



Lemma 36. Let n, d, £,k } m G N, itfit/i m ^ k < £ ^ 2^d \ogn < d/A. Then 

Proof. Recall that Ue(k,m) denotes the number of pairs (S,T) such that S, T C [n] d 
are cubes, with dim(S') = k and dim(T) = m, such that [S 1 U T] is a subcube with 
dim([SUT]) > £. 

There are at most ( < ( d £ fc ) wa Y s to choose the dimensions of S, since there are 

at most this many ways of choosing a sequence of d non-negative integers summing to k. 
Similarly there are at most ( d+m \ ways to choose the dimensions of T. Observe that 

k J \d — kj \k J \d — kj \k J \k 

since k < t ^ 2 v /dIog^n, and k < d/4. Similarly ( d + m ) n 8 (^). 

Now, if 5 is an [ax] x . . . x [a rf ]-cube, then there are at most Yli( n — a « + 1) ways to 
choose its bottom-left corner, and given S, there are at most 



(2d) m+2 \S\ = (2d) m+2 Y[, 



ways to choose the bottom-left corner of T. This follows because [S U T] is a cube, so the 
bottom-left corner of T must be within distance m + 2 of some point of S. 
Thus, by Observation 1351 there are at most 



g) C) m-« n «.(» -«.+d< 2— d - g) g) „«• (^i) 1 



ways of choosing S and T, and so the lemma follows. □ 
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Using Theorem [31 Lemmas [3H and [3S1 and the van den Berg-Kesten Lemma, we can 
now bound K p (We(k, m)) for all k,m EN. 

Lemma 37. Let n = n(d) G N satisfy d ^> logn, and let a > be a sufficiently large 
constant. Let £ = 2^d logn — a), and let p = 2~ s > be as defined in (!T3|) . for some 
e(d) < 1. Let k,m GN, with m ^ k < £ and k + m + 2 ^ £. Then, 

E p (W e (k,m)) < n i6 2 ^-2iogd+o ( i) 

Moreover, if m ^ 6 t/ien 

Ep(Wi(A;,m)) < 2~ £ . 

Proof. Recall that if {S,T) e W^(A;,m) then 5* and T are disjointly internally spanned. 
Thus, by the van den Berg-Kesten Lemma, and Lemmas [3H and [361 

E p (W e (k,m)) < £ P(S,p)P(T,p) 

(S,T)eUi(k,m) 

Now let 5 > be the constant in Theorem [31 and recall that £ 2 2 e p ^ 5 if d is sufficiently 
large, and that k,m < £. Thus, by Theorem [31 

'f)( d ) p (k, P) P(m, p) < T7—;(k\ X^/^^p^' 2 ] (ml \- m / 2 2 m2/4 p( m+2 ^ 2 \ . 
k J \m J k\m\\ / V / 

(Here we used the fact that 2 fc2/4 p (fc+2)/2 > 2 (fc+1)2/4 p (A:+3)/2 , and similarly for m. This 
follows because k,m < £, and 2 e p — >• as d — > oo, see also (jl]).) Thus 

Ep(Wi(fc,m)) < 2 fc+m+2 d fc+2m+2 r2 d+16 (V (fc+m)/2 2 ( fc2 +™ 2 )/y fc + m + 4 )/ 2 

' (14) 

The rest of the proof is now a straightforward calculation. Indeed, we claim that, given 
k ^ m with + m + 2 ^ the right-hand side of (THj) is maximized when k = £ — 2 and 
m = 0. To see this, observe first that the function 

f[k m) ■= 2 k+m+2 d h+2m+2 n d+w (——\ ' A~ (fc+m)/2 2 (fc2+m2)/4 p (fc+m+4)/2 



is decreasing in k. This follows because 



since rf 2 2^ < 2?- 2a , and a > was chosen to be sufficiently large. Thus f(k,m) is 
maximized with k = £ — m — 2. 

Now suppose that k = £ — m — 2 and m > 0. Then either < m < £/A, in which case 

/(A; + l,m-l)V > 1 ( n - l \ ofc-m+i > x 



f(k,m) J d 2 \ n 
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since k — m > £/2 - 2, or m ^ £/4, in which case 



f(k + £/A,m-£/A) ^ (l_V /4 2 eys2 ^ L 



f(k,m) \2dJ 
So f(k, m) is maximized when k = I — 2 and m = 0, as claimed, and hence 



I \ 1-2 

n — 1 



E p (W^,m)) < 2 Vn d+16 ^^-^J x -t/2+x 2 (i-2? /A p (i+2)/2 ^ 
Now, recall that 

(£ + 2a) 2 Z 2 
dlogn = ± = — + a£ + 0(1) 

and note that 

s(£ + 2) = h y/d log n + 2 log d - /?) ^ + 2) = ^ + 2a + 21ogd - p\ U + 2 
= f + £(2\ogd + 2 + 2a- + 4\ogd + 0(1), 

dlogn + ^ - S ^J"^ = -£(logd+ 1-/3/2) - 21ogd + 0(1). 



so 



Hence, 



, \ ^-2 
n — 1 



Ep(Wi(Jfe,m)) < 2*ciV +16 - — - A -^i 2 (^) a /4 p (/+2)/2 



n 

SAn 16 / ^- 1 ) Y n d 2 ^/4-^ + 2)/2 



7? 



But 



<i(n — 1 
fn-l\ 1 / / // 



< n i6 / ( H' t 1 J \ 2 -^( 1 °g d + 1 -' 3 / 2 )^ 21o g d + ( 1 ) 



4A 2 £ = 2 e . 



Thus 



n J V\ \ n J 2aA V V 72-1 
E p (W^,m)) < n ie 2^-2 log d + o(i) ; 

as required. 

To show that if m ^ 6 then E p (W^(fc, m)) <C 2~ e , note that, by the argument above, if 
m ^ 6 then /(&, m) is maximized when k = £ — 8 and m = 6. Thus 

E p (W e (k,m)) <: 2 £+ °( 1 )^ +6 n d+16 f^yA-^ 2 2^/ 4 - 4 V' +2)/2 < dV 6 



n 

by the same calculation as before. Since £ 3> log d+ log n and e < 1, the result follows. □ 
The lower bound in Theorems [1] and [2] now follow easily. 



BOOTSTRAP PERCOLATION IN HIGH DIMENSIONS 



41 



Theorem 38. Let n = n(d) be a function such that d S> logn ^ 1. Let 



p = 4A ( _JL_ ) 1 f i + log^~ 16 log 7i +16 ^ 2 _ V31 5gn 



n — 1 J d 2 



a/ 3 log n 



and /et A ~ Bin ([n] d ,p). T/ien 

as d — )■ oo. 
Proof. Let 



p p ([A] = M d )^o 



and let 



e = £ (d) := - log 



(3 = log 



1 + 



4A 



logd — 16 logn + 16 
a/ d log n 



< 1, 



n 



n — 1 



+ 2e. 



Set s = 2 v / d logn + 2 logd — /3, and note that p = 2~ s . Let a > be a sufficiently large 
constant, and set £ = 2(i/dlogn — a). 
By Lemma l33l we have 

F p ([A} = [n} d ) ^ ¥ p (W e ^l) ^ E p (W e ), 

and by Lemma [37J if d is sufficiently large, 

since 1 ^ £ — ^ m + 2, so we have a bounded number of choices for k and m with m ^ 5. 
But loerfl + z) € z, so 



e£ — 2 log d + 16 log n ^ ( log d — 16 log n + 16 J — 2 log d + 16 log n = — log d + 16. 
Thus, 

P p (A percolates) < (ji) + { 2dlo & n ) 2 ~ £ ~> 



as d — > oo, as required. 



□ 



6. Open problems 

The work in this paper is only one step towards the main aim of this line of research: 
to determine p c ([n] d ,r) for arbitrary functions n = n(t), d = d(t) and r = r(t), with 
n + d— 7-ooast— t-oo. In this section we shall describe some of the most natural next 
steps which one could take, and state some of the most attractive open problems. 
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In Theorem[2]we determined the sharp threshold for p c ([n] d , 2) for all functions n = n(d) 
with d ^> logn. On the other hand, Balogh, Bollobas, Duminil-Copin and Morris [8] 
determined a sharp threshold when d is constant. To be precise, they proved that 



Pc([n] d ,2) 



A(d,2) + o(l) N ''- ] 
logn 



for an explicit constant X(d, 2) > satisfying X(d, 2) = + o(l) as d — > oo. (In fact 
they proved a more general result, determining a sharp threshold for all 2 ^ r ^ d.) 

Problem 1. Determine p c ([n] d , 2) for 1 ^ d = O(logn). 

When d ^ logn the critical droplet is no longer a hypercube, since [2] d is not likely to 
grow. Thus the first step in solving Problem [TJ is likely to be the following problem. 
Let P(k,£,p) denote the probability that A ~ Bin([fc] £ ,») percolates. 

Problem 2. Determine P(k,£,p) when k e p is sufficiently small. 

An even more important extension of Theorems [T] and |2] would be to study the r- 
neighbour process, for a general (but fixed) value of r. The problem seems to become 
much harder when r ^ 3; in fact, we are not yet able to prove even the following fairly 
weak conjecture, even in the case r = 3. 

Conjecture 1. For r fixed, 

p c ([2} d ,r) = expf^-efV/ 2 ^ 1 )). 

A related question, which is often useful in studying the critical threshold, asks for the 
extremal percolating sets. To be precise, given a graph G and an integer r, let 

m(G,r) := min{|v4| : A percolates in r-neighbour bootstrap on G}. 

Note that, by Lemma El we have 

~d{n- I) 



m([n] d ,2) 



+ 1, 



since it is straightforward to construct a percolating set with this many elements. When 
d and r are fixed, Pete [31 J gave various bounds on m([n] d ,r); in particular, he showed 
that m([n] d ,d) = n d ~ l . 

For the r-neighbour process on the hypercube, with r fixed, we have the following 
construction. 

Proposition 39. Let r E N. Then 

, r . J > l + O(l) / (l 

as d — > oo. 



BOOTSTRAP PERCOLATION IN HIGH DIMENSIONS 



43 



Proof. Let d ^ 2r, and let Ai be a system of r-subsets of [d] such that every (r — 1)- 
set is contained in at least one set of A\. By a theorem of Rodl [32], which resolved a 
long-standing conjecture of Erdos and Hanani [19], there exists such a set A\ with 

1 + 0(1)/ d 

T — 1 



as ti — >■ oo. Let A2 denote all the (r — 2)-subsets of [d\. We claim that A := A\ U 
percolates in r-neighbour bootstrap percolation on [2] d . To see this, simply note that 
every (r — l)-subset of [d] has r — 1 neighbours in A2, and at least one neighbour in A±. 
Hence all the (r — l)-sets are in [A], and the (r — l)-sets percolate in [2] d , since d ^ 2r. □ 

We conjecture that the construction in Proposition [591 is close to being extremal. 

Conjecture 2. Lei r G N. TTien 

m([2] d ,r) = e^- 1 ). 

It might even be true that Proposition [39] is essentially sharp, i.e., that 

m([2] d ,r) = (l + o(l))^. 

Since m([2] d , 2) = \d/2] + 1, it follows easily that m([2} d , r) ^ 2 r ^(d - r + 4), since any 
set which r-percolates in Q = [2} d must (r — 1) -percolate in each half of Q separately. 
Amazingly, however, we know of no non-trivial lower bound, even for m([2] d ,3). 

Returning to the case r = 2, we should also like to know the value of p c (Z d ,2), the 
critical probability for percolation on the d- dimensional torus, when d ^> logn. As we 
noted in Remark [U when n ^ 4 this would follow by the proof of Theorem El together 
with a modified version of Lemma [3H However, when n = 3 the critical probability will 
be completely different, since we can span Zg £ with i + 1 infected sites, and there are at 
least 3 e such minimal spanning sets. Nevertheless, it is likely that many of the ideas and 
techniques in this paper will be useful in attacking the following problem. 

Problem 3. Determine p c (%i, 2) . 

Finally, on the hypercube with r = 2, we ask for an even sharper result than Theorem [0 

Problem 4. Determine a G [1,2], if it exists, such that, for any e > 0, 

f(l + tt )2 ^ SPc([2] , 2)<f(l + « )2 - 2 . 

if d is sufficiently large. 

7. Appendix A: Proof of the technical lemma 

In this section we shall prove Lemma H] for an arbitrary sequence (ai, 02, ■ ■ ■) of positive 
real numbers, and an arbitrary A, satisfying the following conditions: 

(a) 1< A < 2, (b) a x = A, (c) V(-l) m+1 a m = 1, (d) > T for every m G N. 

' a-m+i A 

m=l 
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Recall that 



(2A) m 
2 m2 m! 



for each m G N, and observe that these conditions hold for the sequence (a m ). 

The proof of Lemma H] is elementary, and based on a series of straightforward induction 
arguments. We shall use the following simple properties of a m , which follow from the four 
conditions stated above. 

Lemma 40. Let a m be as described above. Then 
, « a m 2 2m (m + l) 6 



A 



A ' 



(b) a m > Oj for every m G N, 



i=m+l 



2m 



2m- 1 



VV-l) i+1 ai < 1 < ^2 (-l) i+1 ai for every m G N, 



8=1 



8=1 



(d) -Ctm+1 < 



.1=1 



< a 



m+l ; 



D->: 



i+1, 



,i=l 



A 



'm+l 



£(-i) m * - 1 



i=l 



Proof. Part (a) is trivial, and part (6) follows easily from (a), since 6/ A > 2. For (c), 

recall that Y^m=i(~ ^) m+l<1 m = 1, and suppose that ^i=i( — l) I+la i ^ 1 f° r some m G N. 
Then by (6), 

t oo 

^2{-iy +1 ai ^ 1 + a 2m+1 - E Oi > 1 

8=1 i=2m+2 

for every t ^ 2m + 1, which is a contradiction. The proof of the other bound is identical. 
Now, (c) implies immediately that 



Bm+l — dm+2 < 



£(-i) m a* - 1 



,i=l 



and by (a) we have 2a m+2 < a m+ i, so (d) follows. Finally, by (a) and (d), 

'm+l 



»+i. 



1 



1 3 3 

^ 2 Ctm + 1 ^ ^ a m+2 ^ 



^(-1)^ -1 



i=l 



and so (e) holds. 

We shall also use the following lemma. 



□ 
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Lemma 41. Let 2 ^ t e N and 1 < A < 2, and let g{m) ^ for each 1 ^ m < 
Suppose that /(l) ^ A/2, and tnat /(£) satisfies the recursion 



A 



t-i 



/(*) < 1 - « + (A -1)(1 + !))/(*-!) + ^/(m)y(m), 



m=l 



for every 2 ^ i ^ t. Then 



A / 1 £_1 



m=l 



Proof. For each £ e N, let 



m=l 



We shall prove by induction on £ that /(£) ^ H(£). The case £ = 1 follows since /(l) ^ 
A/2. Note also that 

(l + #(m))#(ra) ^ e 9(m) H(m) <: H(m + 1), 

since ^ 1 and <?(m) ^ 0. 

2 — A 

Let us think of g{m) and H{m) as step functions defined on R + , i.e., define = 

g(\_z\), so that / g(z) dz = g(m), and similarly for H(m), so 
•> l rn=l 

H(z) = bexp\cj^ g(y)dyj ^ bexp^cj^ g(y) dy^j , 



where 6 = — and c = — — -. We shall use the fact that 
2 2- A 



^g{m)H{m) ^ bj g(z)exp\cj g(y)dyjdz 

b ( f z \ ~ ^ b ( 

- exp \c J g(y)dyj = - exp \ c J g(y)dy 
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which follows by the fundamental theorem of calculus. By the induction hypothesis we 
have f(m) ^ H(m) for every 1 ^ m ^ £ — 1. Thus 

A f=i 
f(£) <: 1 - - + (X-l)(l + g{t-l))H{t-l) + Y,9(rn)H(m) 

m=l 

< 1 - \ + (A - l)H{l) + ^ exp [cj\{y)dy^ - h - 



= H{£) (a - 1 + ~)j + (l - ^ (1 - A) < 
as required, since - = 2 — A, - = A (l — — ) , and 1 < A < 2. □ 



We are now ready to prove Lemma HI 

Proof of Lemma\4\ Let /, g and h be as in the lemma. First we shall show that f(t) does 
not 'jump'. 

Claim 1 : For every t G N, 

< ±h{t-l)f(t-l) < /(f) < ?±h(t-l)f(t-l). 

Proof of Claim 1. We shall prove the claim by induction on t. For t — 1 we have 
/(l) = A/2, so the statement holds since h(0) = /(0) = 1. So let t ^ 2 and assume 
that the claim is true for 1, . . . , t — 1. The induction step follows from the following two 
easy subclaims. 

Subclaim 1 : For every 1 ^ m ^ t — 1 , 

a m h(t — m) f (t — m) ^ 3a m +i/i(i — m — l)/(f — m — 1). 

Proof of Subclaim 1. This follows from the induction hypothesis, and Lemma HOT a). To 
be precise, by the induction hypothesis we have 

f{t-m) > ^h(t-m- l)/(t-m- 1), 

since 1 ^ i — m — 1. Thus, by Lemma HOT a). 

a m f(t-m) > a m A > ^ 



a m+1 h(t -m - l)f(t - m - 1) a m+ i 2 
Since h(t — m) ^ 1, the subclaim follows. □ 
Subclaim 2 : For every 1 ^ m ^ £ — 1 , 

a m h(t — m) f (t — m) ^ 2 aihit — i) f (t — i) . 

i=m+l 
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Proof of Subclaim 2. We prove the subclaim by induction on t — m. When t — m = 1, 
then by Lemma HOT a). 

a t ^h{l)f{l) >^-(\)a t > 2a^(0)/(0), 



2 \\ t 

as required, since /(0) = h{0) = 1, /(l) = A/2 and /i(l) ^ 1. 

So let £ — m ^ 2, and assume the subclaim holds for t — m — 1. Then, 

t 

a m +ih{t - m - l)f(t - m - 1) ^ 2 ^2 a>ih(t — i)f(t -i), 

i=m+2 

and so, by Subclaim 1, 

a m h(t — m)f(t — m) ^ 3a m+ ih(t — m — l)f(t — m — 1) 

t 

> 2 aih(t-i)f{t-i), 

i=m+l 

as required. □ 
Now, by the induction hypothesis, f(t — i) > for all i G [t — 1]. Recall that 



/(*) = J2^ m+la ^ f - m ^ ,{t - m )> 



m=l 

and that a\ = A. Thus, by Subclaim 2, we have 

- A 
_ A ^(t - - 1)| < ^a m h(t-m)f(t-m) <: -h(t-l)f(t-l). 

m=2 

The induction step, and hence the claim, follows immediately. □ 

Lemma H] will follow from Claim 1 once we have re- written the recursion formula for 
f(t) in the following, more useful way. 



Claim 2 : For every 1 ^ t ^ £, 

fit) = i - ^ + J2 ( ( E(- 1 ) 4+l °*) - x ) w - m ^ - m ) 

t 

+ £((/i(f-m)-l)/(t-m)). 

m=l 

Proof of Claim 2. Consider the sum /(£) + /(£ — 1) + ... + /(0), and recall that 

m 

/(m) = ^(-l^Oi/iCm-O/Cm-i) 



i=i 
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for every m G [t\. The claim follows by substituting this sum for each f(m), and then 
grouping terms according to which f{m) they contain. To spell it out, 

t m 

m + ... + /(o) = /(o) + f(i) + Y,(- l ) i+l(x ^ m - *)^ m - ? ) 



m=2 i=l 

t-l / t-k 



/(0) + /(l) - fll /i(0)/(0) + £ h(k)f(k) 



k=0 \i=l 



A 



m=l \ i=l 



Subtracting /(£ — 1) + . . . + /(0), we obtain the claimed recursion. □ 

We are now ready to prove the claimed lower bound on f(£). To keep the formulae 
concise, for each m G N define b(m) = 'Yl™ =1 {— l) i+1 aj — 1. Recall that by Lemma 1401(c) 
and (e), we have b(m) > if m is odd, and 

\b{m)\ > ^|6(m + l)|. 
A 

Furthermore, f(t — m) ^ ^ — m — — m — 1), by Claim 1, and — m) ^ 1. 
Thus, if m is odd, 



^ ( ij^ m \i • - - 1 ) |6(m + 1)1 h(t - m - 1) f(t - m - 1) ^ 0, 



b(m)h(t — m)f(t — m) + 6(m + l)/i(t — m — l)/(£ — m — 1) 

|6(m)| A 
|6(m + 1)| ' 2 
Therefore 

i 

y2b(m)h(t-m)f(t-m) ^ 0, 

m=l 

since each pair of terms is positive, and if t is odd then the final term is also positive. 
Recall that h(t) ^ 1 and f(t) ^ for every t G N (by Claim 1), and note that therefore 

t-i 

V (h(t-m) - l)f(t-m) ^ 0. 

m=l 

Thus, by Claim 2, we have 

m > i - \ 

for every 1 ^ t ^ £. 

The upper bound follows from Lemma SH and the following claim. 

Claim 3 : Let t ^ 2. Then 

A <_1 

/(*) < i - g + + E/H^H- 

m=0 
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Proof of Claim 3. Recall that h(t — m) — 1 ^ g(t — m) for every m G [t — 1], so the claim 
will follow from Claim 2 if 

t 

^b(m)h(t-m)f(t-m) ^ b(l)h(t - l)f(t - 1). 

m=l 

But this follows as before. Indeed, if m is even, then 

b(m)h{t-m)f(t-m) + b(m + l)h{t - m - l)f(t - m - 1) ^ 

by the argument above, and if t is even then the last term is negative. □ 

Finally, recall that by Lemma HTJ ifl < A < 2, /(l) ^ A/2, and f(t) satisfies the 
recursion 

A <_1 

/(*) < i - g + (A-i)(i + ^-i))/(t-i) + &H«H, 

m=l 

for every 2 ^ t ^ £, then we have our desired upper bound on f(£). But g(0) = 0, 
6(1) = A — 1, and h(t — 1) ^ 1 + g(t — 1). Thus, by Claim 3, the conditions of Lemma SH 
hold, and the lemma follows. □ 
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